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Distributions 



1. Let 0 be a field of sets E which are subsets of the line X : -®0< x < oo 
and suppose that every interval is in t. Clearly, 0 contains the Borel field 
It is also clear that the Borel field is a 0. 

Let there be given on i a (non-negative) measure \x such that the [i - 
measure of X is 1. The ^-measurability of a set can then be defined to mean 
that the set is contained in 0. This does not preclude the possibility that 
there exists a field more extensive than 0 to which [A can be extended from 0. 

It will be necessary to consider various measures, [A*, [a",... simul- 
taneously, and so it will be convenient that the fields 0', 0" , ... associated 
with them should be chosen to be identical. This can be accomplished by 
choosing 0 to be the Borel field for every |a. This will always be done in 
the sequel. Thus the decision as to whether a set is or is not measurable 
becomes independent of the choice of the measure {A. 

Accordingly, E from now on will always denote an arbitrary Borel set 
on the line X, and |A will be any function jA = |A;g, E satisfying the fol- 
lowing conditions: 

(!) Hr; 2 0; (2) H-E+F+... = !- 1 e + Hf + •••> (-) H-X = 

where E,F, ... is an arbitrary sequence of mutually disjoint Borel sets and 
X is the set of all real numbers x. 

By a distribution (on the line X : -o©< x <oo) will be meant any 
function |A = fig of E satisfying (l), (2), (3). The name is due to the fact 
that the measure, [A^,, of a given E can be interpreted as the probability 



that a point x of X is in b 
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2. Let X = Xj; be a function defined for every Borel set E in 



such a way that (l), (2) and the condition py < oo , a condition not 
implying (3), are satisfied by p = X. 

Clearly, Xj <. Xg <. X ^ for every E, if I denotes the interior, and 
C the closure, of E. Let an E be called a continuity set of X if Xj = 

For instance, if E is an interval (closed, open or neither), and if x = a 
end x = b denote its end-points, then E is not a continuity set of X if 



and only if either X & > 0 or X^ > 0 (the points a,b are thought of as 
Borel sets). It is clear that X is a continuity set of every X. 

The last remark implies that, if p^, p^,... is a sequence of 
distributions (i.e., if (l), (2), (3) are satisfied by p = p^ for 
n = 1,2,...), and if there exists a X such that, as n — > oo , the value 
p^ 11 ^ tends to the value X^, wnenever E is a continuity set of X, then 
U = X satisfies not only (l) and (2.) but (3) as well, i.e., X must then 
be a distribution. 

On the other hand, if p , p , ... is a sequence of distribu- 

(n) 

tions such that the limit of p V ^ as n — » o© tends to a limit for every 
fixed Borel set E, then the limit, when considered as function of E, need 

not be a distribution. This is illustrated by the example in which the 

f \ (n) 

distribution p' ; is the one attaining the value p x = 1 for that Borel 

(n) 

E 



set which consists of the single point x = n. The value p 



then 



3 



tends to 0 or to 1 according as E does not or does contain aninfinity 
of the points x = 1,2,... . 

Conversely, a sequence of distributions |X^,|X^,... can be such 

that there exists a distribution \i for which tends to [X for every 

E E 

continuity set E of n but not for every Borel set E. An example to this 
effect results if \i^ is so chosen that, on the one hand, every interval 
is a continuity set of (X^ 11 ) (i.e., if = 0 for every point xj cf. 

above) and, on the other hand, [X^ attains the value jx^x = 1 for the 
Borel pet consisting of the interval 0 < x < l/n. In fact, if fx denotes 
the distribution for which (X^, is 0 or 1 according as the point x = 0 is 
not or is in the Borel set E, then [X^ n ^ obviously tends or does not tend 
to |X according as E is an arbitrary continuity set of E or consists of 
the Borel set represented by the point x = 0. 

In view of the above pair of examples, the notion of a convergent 
sequence of distributions H^,|x^,... will have to be defined by the 
following requirement: There exists on X a Borel measure |X such that 

(n) 

tends to jx„ whenever E is a continuity set of [X. In this, and 
EE — ' 

only in this, case will the notation |X^ n ) (X be used. Since X is always 
a continuity set, it follows that |x^ n ) [x cannot hold unless [X is a 
distribution . 



3. Let f(x) be a Baire function (of the position x) on X and let |x 
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be a distribution. Then, if f(x) is real-valued, the set of points x 
satisfying f(x) < a, where a is any real number, is a Borel set and 
has therefore a [^-measure. 

Hence, if f(x) is any bounded Baire function on X, the Borel- 

Lebesgue integral J f(x) d|i is defined and has, by (3), an absolute 

value not exceeding the least upper bound of J f (x) ( . Here and in the 

f the 

sequel, any integration, J , in which A range E is not marked, extends 
over the Borel set E consisting of the whole line, E = X. 

If f(x) is a Baire function which, instead of being bounded, is 
non-negative, J f (x) d|X still exists but can be . It is understood 
that "non-negative" always means "real and non-negative". 

If f^(x), f 2 (x),... is a sequence of non-negative Baire functions 
on X, then, according to Fatou, 




This implies that, if every f n (x) is a non-negative Baire function on X, 
then 

(4*) J*f (x) iafj^ f Q (x) dj* 

whenever f n (x) tends to f(x) almost everywhere on X, i.e., whenever 
f n (x)— ^f (x), where f(x) < 0 °> holds for every x not contained in a set 
of (^-measure 0. It follows from (4*) by classical reasoning that, if 
the assumption f (x) > 0 of (4*) is replaced by ( f Q (x) | < const., then 
(4*) can be refined to 

(4) J* f n (x) d|* ^ f(x) d|A. 

In other words, (4) holds for every uniformly bounded sequence of Baire 

functions f n (x) which tend to f(x) almost everywhere on X. This implies 
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that, if f-^(x), f (x),... are uniformly bounded Baire functions on an 
arbitral Borel set E and tend to f(x) almost everywhere on E, then 

J f“ n (x) d^i — » J* f(x) dfj,. 

E E 

In fact, (4') follows from (4) by placing f n (x) = 0 and f(x) = 0 for 
every x contained in the complement, X - E, of E. 

Fatou’s inequality has a dual, corresponding to the case in which 
the measure function, rather than the point function, varies with n. 

In particular, (4*) is paralleled by the inequality 

(5*) ff (x) dp, <. lim inf f f(x) djj. ^ n ^ , 

J / -» \ ( 

if f(x) is a non-negative Baire function and ,p ^ , ... any sequence 
of distributions satisfying p( n ) — » jj. in the sense defined at the end of 
§ 2. On the other hand, it is not true that, corresponding to the as- 
sumptions of (4) , 

(5) J f< (x) dp^ — » J*f(x) dp 

follows from p(°) — > p for every bounded Baire function f (x) . In fact, 
if p^ n ) and p are the distributions defined in the second example of §2, 
then — * p is 'satisfied but, if f(x) = 0 for every x / 0 and f(o) = 1, 

the integrals on the left and on the right of (5) have the value 0 and 1 
respectively. However, (5) follows from p^ n ) — » p, if f(x) is a continu- 
ous bounded function on X. 

In order to prove this, it is sufficient to show that, if p^ n ) — » p, 
there exist arbitrarily large numbers a > 0 such that, if E = E fl denotes 
the interval -a < x < a, then E is a continuity set of p and 

(5*) J* f(x) d/ n ) — ► J f(x) dp 

E E 
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holds for every f(x) that is continuous on E. In fact, (5) then follows 

by subtraction, since on the one hand, [ f (x) | < const, on X and, on the 

other hand, ^ u as n — ^ qo holds when the continuity interval, 

X—E X—E 

E = E a , of (A is fixed, while ^_ E = 1 - — ? 0 holds as a — * oo in 

E = E fl . But the interval E & is a continuity set of |i whenever both 
points x = +a, thought of as Borel sets, are of jinneasure 0 (cf. § 2) , 

and (1), (2), (3) imply that the set of the points x which have a posi- 
is 

tive measure y. XA at most enumerable. Consequently, it is sufficient to 

r \ 

ascertain that (5* ) follows from n' ^ p, under the assumptions that E 
is a closed, bounded, continuity interval of fi and f(x) is continuous 
on E. But the truth of (5*) under these assumptions readily follows 
frbm the uniform continuity of f (x) on E and -from the definition of 

^ |4. at the end of § 2. (in order to see this, it is sufficient 
to approximate f(x) by a majorizing and by a minorizing sequence of 
uniformly convergent step-functions, chosen in such a way that every 
interval of constancy of each of the step-functions is a continuity 
set of |i.) 

It may be noted that (5 1 ) does not follow from |A, if 

E is an arbitrary closed, bounded interval on which f(x) is continuous. 

In fact, if f(x) = 1 for every x, then (5) claims that |A^ n ^(E) tends to 
the limi t [i(E); a claim which does not follow from the assumption 
| x (cf. § 2). 
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4. For a given distribution |Jl, let S = S ((.',) denote the set of 
points x having the property that the f,l -measure \).j of any open inter- 
val J containing the point x is distinct from 0. It is clear from (l),(2), 
(3) that the set S is closed and not vacuous, and that p„g = l. 

It was pointed out in the proof of (5') that the set of points 

x each of which has a measure u distinct from 0 is at most enumerable. 

x 

Let P = P([J,) denote the set of these points. In contrast to the set S, 
the set P need not be closed and can be vacuous. 

Let S = S(}i) be called the spectrum, and P = P(|^,) the point 
spectrum, of |a. Clearly, P is a subset of S. 

A distribution \i is called continuous if every Borel set E is a 
continuity set of p, (in the sense defined at the beginning of £ 2) . In 
particular, every point x must then be a continuity set, which implies 
that fig = 0 for every E = x, the interior of every E = x being the empty 
set (and therefore, by (l), (2), (3), a set of ^-measure 0). Accordingly, 
if |i is continuous, then its P is vacuous. It is easily verified from (1), 
(2), (3) that this condition not only is necessary but is sufficient as 
well in order that p be continuous. In other words, | x is continuous if 
and only if every enumerable set is of -measure 0. 

A distribution \i is called purely discontinuous in the other 
extreme case, namely ^in case there exists an enumerable set having the 
same ^.-measure as the whole x-line. It is clear from (3) and from the 
definition of P, that |i is purely discontinuous if and only if (!_, = 1. 
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Correspondingly, fi is continuous if and only if ^ = 0. In fact, it is 

clear from (l), (2) and from the definition of P, that, since P is at most 
enumerable, Up is 0 if and only if P is vacuous. 

vihile the above definitions involve only the measure fi itself, 
the following notions are properties of a distribution 1 1 with reference 
to another measure, namely, with reference to the ordinary, i.e. "Euclidean", 
Eorel measure (a measure which, incidentally, satisfies only the first two 
of the conditions (1), (2), (3) for a distribution, since it assigns to 
the Borel set a < x < b the measure b - a). 

A distribution \i is called purely singular if it is continuous 
and such that there exists a Borel set Z of Euclidean measure Q for which 
= where X is the whole x-line. In other words, |i is called purely 

singular if Up = 0 an# = 1, where P denotes the point spectrum of |A and 

Z is a suitable set of Euclidean measure 0. Since fig = 1 holds for every 
distribution, every continuous distribution having a spectrum S of Euclid- 
ean measure 0 is purely singular. On the other hand, if |A is pirely sin- 
gular, then S can be an interval (or, for that matter, S can be identical 

with Xj cf ^63 below) . This implies that the set Z occurring in the 
definition of a purely singular \i cannot in general be chosen to be a 
closed set, since S need not be nowhere dense. Consequently, there is 
no reasonable way of defining the set Z as a unique functional Z = Z(|a) 
of an arbitrary purely singular distribution |i. In fact, Z can be 
replaced by any set differing from Z by any enumerable set, since, [A be- 
ing continuous, not only the Euclidean measure but also the |A-measure of 
every enumerable set is 0. 
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The other extreme case of a continuous distribution is represented 
by the case of absolute continuity. A distribution |X is called absolutely 

continuous if LX = 0 holds for every Borel set E of Euclidean measure 0. 

iii 

Since P is at most enumerable, the condition |Xp = 0, i.e. the continuity 
of | x, is a necessary (but not, of course, a sufficient) condition for the 
absolute continuity of [X. According to Lebesgue's form of the Fundamental 
Theorem of Calculus, a distribution [L is absolutely continuous if and only 
if there exists on X a Baire function f such that 



(6) 






= jf(x) 

E 



dx 



holds for every Borel set E, where the integration refers to the Euclidean 
measure. The function f = f^(x) is called the density of ^ = |X E ; it is 
uniquely determined by the distribution if and only if x-sets of Eu- 
clidean measure 0 are disregarded. In this sense, the validity of both 
(l) and (6) for every E implies that the density is a non-negative function 
of x on X. According to (3) and (6), this non-negative function has a 
finite integral, ^f(x) dx = 1, over the line X. 

Let a distribution |x be called pure if it is either absolutely 
continuous or, if it is not absolutely continuous, it is either purely 
singular or purely discontimxous. According to Lebesgue, every distribu- 
tion |x admits of a unique decomposition into pure terms, 

(7) He p pV e + P 11 ^ + p n y n E , 



where E is an arbitrary Borel set, p^, p^ , p^^ are non ~ ne £ a tive constants 
whose sum is unity, are distributions belonging to the three 

pure types respectively, and all three terms on the right of (7) are uni- 
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quely determined by the given distribution [j, = pig. 



5. If = |4.£ is any distribution on the line X, let |j,(x) , where 



of those points of X which are on the left of the point x. Then it is 
clear from (l), (2), ( 3 ) that ^i(x) is a monotone function satisfying 



(the last relation, stating that ^l(x) is continuous from the left at 
every point x, follows from the fact that x itself was excluded from E x ). 
Conversely, if |i(x), — oo< x < 00 , is any given monotone function satis- 
fying (8), it is easily verified from the definition of arbitrary sets 
E contained in Borel's field, that there exists a unique set-function 

H = [*£ satisfying (l), (2), (3) and attaining the value |i(x) for every 
half-line E X . 

In view of this one-to-one correspondence, not Dnly every function 
satisfying (l), (2), ( 3 ) on Borel's field, but also every monotone 
function |i(x) satisfying (8) on tne x-line, will be called a distribution 
on X. Correspondingly, the Borel-Lebesgue integral with reference to the 
measure as defined at the beginning of §3, is identical with the 
Borel-Stieltjes integral with reference to the monotone function |x(x). 



— 00 < x <00 , denote the |j,-measure of the half-line E = E x consisting 



(8) fi(- 00 ) =0, [J,( 00 ) =1 and ^(x - 0) = [i(x) 



(9) 




X 
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whenever f(x) is a Baire function for which either integral exists (e.g. , 
for which f(x) > -const.). 

It is clear that the value of the set-function for the Borel set 
E consisting of the point x is |i(x + 0) - (.i ( x - 0) . Hence, a point is not 
a continuity set of the set-function if and only if it is a discontinuity 
point of the point-function fi(x) . In particular, (j. is a continuous distri- 
bution in the sense of §4 if and only if |i(x) is continuous on X. 

For an arbitrary distribution, the function |i(x), being non-decreas- 
ing, has a finite, non-negative, derivative ^.'(x) almost everywhere in 
Euclidean measure, and the integral J* p'(x) dx over the line X has, by 
(8), a value not exceeding 1 (Lebesgue) . In fact, J* p'(x) dx is 1 if 
and only if the point-function ^.(x) is absolutely continuous on X. This 
will be the case if and only if the set-function corresponding to \x is 
absolutely continuous in the sense of § 4> in which case (8) holds for the 
density f = p' and for every Borel set E. 

It is easy to prove that, if ... and are distribu- 
tions on X, then — » jj, holds in the sense defined at the end of § 2 

/ \ 

if and only if the corresponding point-functions, p' '(x) and [x(x), have 
the property that pi' ' (x) tends to the limit |x(x) at every continuity 
point x of |J.. 
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Transforms 

6. Let y be a point of a line - oo < y < oo . Then f(x) = e^ 3 ^, 
being a bounded Baire function on the line -co< x <oo, is a function 
for which the integral (9) exists. This integral, a function of y on the 
line -c©< y <«a, will be called the transform of the distribution which 
is represented by u on the line - oo< x < co . Thus 

oo 

(10) H°(y) = J e lx y d[i(x)j -«><y<oo, ( J=J), 

— CO 

if [i°(y) denotes the transform of |i(x). It is clear from (l) and ( 3 ) that 

( 11 ) In°(y)|<l; ( 12 ) n°(o) = 1» 

and, from (12), that J" (l - e^V) d| 4 ,(x) = 1 - |i°(y), hence 

(13) J (1 - cos yx) d^i(x) =real part of 1 - [A°(y). 

It is also seen from (10) that 

(14) i*°(y) fi°(-y) = |^°(y)| 2 . 

Let e denote the distribution whose spectrum S consists of the single 
point x = 0, i.e., for which 

(15) e(x) = 0 if x < 0 and e(x) = 1 if x > 0; so that e°(y) = 1, 
by (lO). The distribution e(x) will be called the unit distribution. 

If |i(x) is any distribution, and if c is a real number, let c ^(x) 
denote the distribution, ^(x - c) , which results if the origin of the 
x-axis is translated by c. Then it is seen from (10) that c |A°(y) = e lc ^|4,°(y). 
In particular, from (15) > 

(16) c s°(y) = e ic yj so that <V°(y) = c e°(y) ^(y). 

In contrast to the definition of the distribution c |i, let the 

distribution [J. be defined by changing the units along the x-axis, and, 
c 
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if c < 0, by a subsequent transposition of the x-axis, as follows: For 

a given distribution |i(x), let (.t(x) denote the distribution u(x/c) or 

the unit distribution e(x) according as c > 0 or c = 0; while if c < 0, 

let c |J.(x) denote the distribution which is equal to 1 - |a(x/c) when x 

is a continuity point of p, and, in accordance with (8) , is continuous 

c 

from the left at every x. It is seen from (10) that the transform of the 

distribution jj, is given by 
c 

(17) </(y) = Mcy) 

in both cases c > 0, c <0$ and (15), when compared with (12), shows 

that (17) holds in the limiting case c = 0 also. 

c 

It is clear from the definition of the two linear sheaves, jj, 

a 

and \i , just defined for every distribution \l, that the distribution \i 

b k 

is not identical with the distribution [A (not even in the case a = b). 

a 

From the case c = -1 of (17) and from (14) , 

(l**) _j^°(y) = H°(-y) = complex-conjugate of p°(y) . 

According to (16) , the absolute value of the transform of every 
c 

translation, s, of the unit distribution is constant. It turns out 

c 

that this is a characteristic property of the distributions e. More 
than this will be proved by showing that a distribution n must be of the 
form c, if [j, (y) = |x°(0) holds for a sequence of y-values which are 
not contained in an equidistant sequence, y = a + |5k, (k = 0, +1, +2,...). 
Hence, it is more than sufficient to show that, if there exists a single 
non-vanishing y, say y = y Q , satisfying ^(Jq) = H°(0) , then there 
exists a number = x (y ) such that the spectrum of fx(x) is contained 
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in the periodic sequence x = Xq + 2tik/y^. According to (12) and (ll) , this 

means that the absolute value of a transform cannot attain its maximum at two 
points unless the distribution has a spectrum contained in a periodic sequence. 

In order to prove this, suppose that corresponding to there exists 
a Jq ^ 0 satisfying J [a 0 (y^)j = 1. Then (16) implies that, if \i is replaced by 
a suitably translated distribution, C (i, it can be assumed that (.l°(y 0 ) = 1. 

But then (13) shows that, the function 1 - cos (xy^) of x being continuous 
and non-negative, 1 - cos ( xJq) = 1 must hold for every x contributing to the 

integral J* (l - cos xyQ)d(.i; cf. (l). This means that ^ must be purely dis- 
continuous, with a point spectrum contained in the sequence formed by the roots 
x of the equation 1 = cosCxy^). This completes the proof of all the statements 
made above. 



7. It will now be shown that for the transformation (10) of a distribu- 
tion p, into [A° the following uniqueness theorem holds: 



( 19 ) 



o o 

(y)~:!' l 2 (y) only when |A (x)S^i (x). 
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If (10) and (8) are applied to both monotone functions 
it is seen, by placing 0 = ^ - |i , that (19) is equivalent to the fol- 
lowing statement: 



(19*) if J|dp(x)| <cfi and Je^ dp(x) = 0, then p(x - 0) = const. 

The proof of (19*) proceeds as follows: By assumption, 0(x), 

- o»< x < 0 ® , is a function of bounded variation satisfying J* fdp = 0 for 
every function f(x) = e^ 3 ^, where y is an arbitrary real constant. Hence, 

J* fdp = 0 holds for any trigonometric polynomial f(x) = ae^ 3 ^ + be 3 "* 2 + ..., 
and so, since | dp 1 < <y» , for any function f(x) which is^uniform limit 
of such polynomials. It follows therefore from Weierstrass' approxima- 
tion theorem, that J' fdp = Q whenever f(x) is a continuous periodic 
function. It will nor; be shown that = 0 holds for f = f j also, if 

f j(x) denotes the function which is 1 or 0 according as x is or is not 

in J, where J is any bounded interval on the x-axis. This will com- 
plete the proof of (19*) , since it is clear that ^ f jdp = 0 cannot hold 
for every J unless the function p(x) , which is of bounded variation, i3 
constant on the set of its continuity points. 

Let J be fixed and let p be any positive number so large that the 

interval -p < x = p contains J. Let f j (x) denote the function which 

has the period 2p and attains, if -p < x < p, the value 1 or 0 according 

p 

as x is or is not in J. Clearly, fj can be represented as the limit 
of a uniformly bounded sequence of continuous functions each of which 
has -the period 2p. Since J* fdp = 0 holds for each of these approximat- 
ing functions, it follows from Jj dp( <0^ that J* fj dp = 0. But 
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f.T P (x) 



• f j(x) as p — ^ holds for every x. It follows therefore from 

J* | d& | < 0 ^ and | fj^j < 1 that J f jdp = 0. This completes the proof of 



(19*). 



This proof of (19) needs the assumption of (19) for -00< y < 0 ® . 

Actually, there does not exist any Const. < <jO having the property that 

(4,-^ = |4 2 is implied by the truth of = (X^Cy) f° r I y( < Const. In view 

of (l7), it is sufficient to prove this negation for the case Const. = 1, a 
case in which the truth of the negation follows by considering, for instance, 
the even functions g(y) and h(y) which are defined for - qo< y <<&> as follows: 
The function g(y) equals 1 - | y| for -1 y < 1 and has the period 2, while 

h(y) is g(y) or 0 according as -1 < y < 1 is or is not satisfied. Thus h(y) = 

Max(0,l - | y |) for every y, and so a Fourier inversion (or an integration of 
Dirichlet 1 s factor of discontinuity) shows that h(y) = J' f(x)e ixy dx, where 
- 00 < x <^P , is satisfied for -0°< y by an elementary continuous func- 

tion f(x) which is nowhere negative. Since J"f(x) dx = 1 in view of h(0) = 1, 
it follows that (6) defines an absolutely continuous distribution. If the lat- 
ter is called a comparison of (10) with the definition of f(x) shows that 

( 4 ^°(y) = h(y) for -o °< y < ^ . Similarly, if the Fourier constants of the 

function g(y) = g(y + 2) are evaluated, it is found that every a^ is non-negative 
in the Fourier series g(y) = ^ a^e^^y, where k = 0, +1,.... Thus there exists 
a purely discontinuous distribution [X by means of which g(y) is representable 
in the form (10) for -oe< y <Qo . If this |X is called then 
Hq.° (y) = M’2°(y) for -1 < y < 1 , since h(y) = g(y) for -1 < y < 1. But 
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the distributions |*^, j *2 are no ^ identical, since is continuous, while 
1*2 is not. 



8. In view of this example, it is of interest that there exist 



sufficiently small absolute value, then (*2 satisfies this relation for 
every y, and so |*2 = Hi* 

Such a distribution is, for instance, the unit distribution, e. 



property of a will become of fundamental importance, it is worthy of a 
more direct proof (a proof which, incidentally, appears to be the old- 
est of its type) . 

According to ( 15 ), the assertion to be proved is that, if |*°(y) = 1 
holds in a vicinity of the point y = 0, then (* = e. Hence, it is more 
than sufficient to prove that 

(20) if p°(y) = 1 + oCy 2 ) as y — > 0, then |* = a. 



as y — >0. It follows therefore from 1 - cos yx > 0 that, as y — > +0, 



certain particular distributions having the property that, if any 
distribution (*2 satisfies the relation |*^°(y) = [* 2 °(y) f° r every y of 



This is implied by the results found at the end of § 6. Since this 



But ( 13 ) and the assumption of (20) imply that 




i/y 



i/y 




(l - cos yx) d|*(x) = o(l) j hence 




-i/y 



-l/y 
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o 

since a (1 - cos a) > const. > 0 if 0 < +a < 1. The second relation of 
the last formula line, when compared with (l) and (15), completes the 
proof of \x = e. 

Two similar consequences of (13), to be used in the sequel, can 

be formulated as follows; If [i is any distribution, then 

r 

(21) y 2 y x 2 d[i(x) < 3 | 1 - H°(y)| f or J y| < l/r, 

-r 

where r > 0 is arbitrary, and, again for every r > 0, 

? V r 

(22) ( J " pd|i(x) < 2r J j 1 - |A°(y)/ dy. 

-cP ~ r -l/r 



First, since the real part of a complex number z has an absolute 

value not greater than I z | , it is clear from ( 13 ) that 

cO r 

(23) 

~ r - cP -r 



1 o> r 

j (1 - cos yx) df*(x) + 'I - f> “ - cos yx) d\x{x) < 1 1 - p.°(y)J J 



where y and r (> 0) are arbitrary. Both terms on the left of (23) are 

non-negative. If the second term on the left is omitted, (21) follows 

from (23) by observing that a 2 £ 3(1 - cos a) for -1 < a < 1. If the 
first term on the left is omitted, (22) follows from ( 23 ) by integration 

over the interval -l/r < y < l/r, since 
l/r 

C 2 r x 

\ (1 - cos xy)dy = r (1- ^sin ~) and 2(1 - sin 1) > J. 

-l/r 

An obvious adaption of the proof of (20) shows that 
(24) if H°(y) = 1 + O(y^) as y — ¥ 0, then J* x 2 d^(x) < o° . 
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In fact, it is clear that, if the o-assumption of (20) is replaced by 
the O-assumption of (24), then the o(l) in the formula line following 
(20) can be replaced by 0(1) | which proves that J'x^d[j,(x) <0° • 

It should be mentioned that (24) becomes false if y ,x' are re- 
placed by jyj, |x( respectively. In other words ,Jjx| dfi(x) -t>o is com- 
patible with ^°(y) = 1 + 0(|y|) as y — ?0. In fact, not even |x°(y) = 

1 + o(|y|) as y—?0 ensures thatj|x| d[x(x) < . 

In order to prove this, let p.(x) be the absolutely continuous 

p 

distribution for which the density, n’(x), is 0 or const ./(x log x) 
according as | x | < 2 or | x | > 2 (the munerieal value of const. > 0 is 
assigned by the condition ;-■( x)dx = y d\x (x) = 1). Clearly, 

( |x|d[j,(x) = oo in this case. On the other hand, from (12) and (10), 

J <70 



0 < 



1 - Ay) 



const. 

.0, 



■ fh 



cos yx 



dx. 



2 x log x 

It follows that [Ay) is an even function and that, since 0 < 1 - cos a 
_< Min(l, aP~) for -CO< a <C° , the difference 1 - (Ay) is majorized by 



a constant multiple of 

T-#- 

J X log X 
2 



"dx 



0 ° 

+ J 7 log X = 7 /log y + ° y/ioe 7 

i/y 



= 0(-y/log y) = o(y) , as y — *+0. 
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9. The uniqueness theorem, (19), states that the knowledge of the 

transform, [X°, of a distribution, [A, is sufficient for the determination 

the 

of [X itself. There arises question as to the existence of a correspond- 
ing completeness theoremj that is, of a theorem which assumes appropriate 
convergence properties of the transforms of a sequence of distributions, 
and asserts that, if the transforms have these properties, the distribu- 
tions themselves must converge (in the sense defined at the end of £ 2) . 

It will now be shown that such a completeness theorem can be formulated 
as follows* If the transforms •• ,0 ^ a sec l uence 

distributions have the property that there exists on the y=line a func- 
tion 0 = 0(y) which is continuous at the point y = 0 and is such that 

? 0(y) as n =» oe holds at every point y, then there exists 

a distribution |X satisfying [Xjj — ^ [X as n > o° ; moreover, the trans- 

form of this (X is precisely the given limit function: 

(25) n°(y)=0(y). 

The proof of this completeness theorem will require of the function 
0(y) only the property that 

l/r 

(26) r J* | l - 0(y) | dy — * 0 as r — ? o? 

-l/r 

•Since ^ n °(y) — > 0(y) for every y, it is seen from the case [X = ^ of (10), 
(11) and (12), that 0(y) is a bounded, L-measurable function satisfying 
0(0) = 1, and so it is clear that the continuity of 0(y) at y = 0 is 
sufficient for (26). Actually, it will turn out in ^10 that the milder 
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assumption, namely (26), implies the continuity of cp(j) at y = 0 and, 
for that matter, at every y. 

However, some restriction of the behavior of fp(y) at y = 0 is cer- 
tainly needed in the completeness theorem. In other words, if it is only 
assumed that there exists a function 0(y) satisfying H n °(y) — for 
every y, then there need not exist a distribution |X satisfying [X^ — ^ [x. 

An example to this effect results by placing H n (x) = [X^x/n) and choosing 
|Xq = Hp(x) to be the absolutely continuous distribution for which the 
density j-t^' (x) is 1 or 0 according as x is or is not in the interval 
0 < x <1. Then it is clear that the limit of (J^(x) as n — * O* exists 
and is 0 for every xj so that, since the constant 0 is not a distribution, 
there does not exist a distribution [X satisfying ^ > ]X. However, 

H-l°(y) = i(l - e iy )/y, by (10), and jx n °(y) = ^(ny), by (17); so that, 
if 0(y) denotes the function which is 1 or 0 according as y = 0 or y / 0, 
then [X n °(y) — » 0(y) holds for every y. Correspondingly, (26) is violated. 

In order to prove the completeness theorem formulated above, use 
will be made of Helly's selection theorem, according to which every uni- 
formly bounded sequence of non-decreasing functions contains a subsequence 
which converges to a non-decreasing function at every continuity point of 
the latter. Thus, if |X-^(x), [X ( 2 :),... is any sequence of distributions, 
it contains a subsequence, say ix^ (x) , U( 2 ) (x) , . . . , such that [^( n )(x) 
tends, as n — O® , to (x(x) for every x that is not a discontinuity point 
of |X, where |X is a certain non-decreasing function. Clearly, [X can be 
normalized so that it becomes continuous from the left at every x. How- 
ever, the preceding example implies that [X will not in general be a dis- 
tribution . 



22 



But it will now be shown that \i must be A distribution whenever the 



assumptions of the completeness theorem are satisfied. Since 
are distributions, it will of course be sufficient to show that 

OO p 

(27*) lim sup ( f - f) du (x) 0 as r — * oo . 

n — * °o J J n 

— co — r 



It follows therefore from (22) that it will be sufficient to show that 

V r 

(27) l^sup r J I 1 - ^°(y)jdy -)Oasr-»co . 

-1/r 



To this end, choose a positive number e and determine, corresoondirg to 
= c 

l/r 



it, a positive q = q satisfying 

w 



i 



1 1 - My) | dy < s for r > q 



e* 



The existence of such a q g is the assumption (26) . The other assumption is 



that H. n °(y) — > My) as n — > oo holds for every y. According to Lebesgue’s 
theorem on term-by-term integration, this implies that, if r > 0 is arbitra- 



rily fixed. 



l/r 1 /r 

r J h-Hn°(y)Uy=r J |l-0(y)|dy, 

-1/r 



-1/r 



since, in view of the case \i = [i n of (10) and (ll), the functions ^ n °(y) are 
L-integrsble and uniformly bounded. Clearly, the last two formula lines 
entail the truth of (27). 

This proves that the sequence [h^,^,... contains a subsequence u^, 
jx^,... for which there exists a distribution p. satisfying — * (J,. 



This implies, of course, that every subsequence of , M-p , . . • 



contains a 



It will now be shown 



subsequence which converges to a distribution, 
that, if a subsequence of > • • • tends to a distribution, then the 

latte:: is independent of the choice of the convergent subsequence. Ac- 
cording to the principle of the excluded middle, this will prove that 
the sequence |^,|;. 2 > ••• itself tends to a distribution. 

In order to prove that j^, is independent of the subsequence whose limit 
it is, let H'p ) >^( 0 ) • be an 7 subsequence of {.i^ , f-ip , . . . satisfying 



where y is arbitrarily fixed, it follows from the definition (10), that 



1^* ^q) (2) * * * * wnicn ^ ^ 

This concludes the proof of the completeness theorem, formulated 
before (25). 




|i for a certain distribution \i. Then, if (5) is applied to the 
sequence H'(2) , ** > anc ^ "^ 0 bhe bounded continuous function f (r.) = e 1X ^", 



~ O 

On the other hand, (y) — >6(y) 

for every y, since, by assumption, p n °(y) — ^0(y) for every y. Conse- 
quently, |r satisfies (25) for every y. But the function i(y) is independ- 




ent of the choice of the subsequence ^( 1 ) >H(2)» • • * ^l’H'o’ * * * > since 
p n (y ) — * d)(j) • It follows therefore from (25) and from the uniqueness 
theorem ( $ 7) , that ^ is independent of the choice of the subsequence 



10. It is easy to see that, if j.i and 



are arbitrary dis- 



tributions, 
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(28) H- n — » \i if and only if \i n (y)— ^ p°(y) for every y. 

This is a much easier statement than the completeness theorem, since, 
in contrast to the latter, the assertion of (28) does not contain an 
existence statement. Correspondingly, (28) can be proved without having 
recourse to (22) . 

If |J. n — - then, if (5) is applied to the bounded, continuous func- 
tion f(x) = e 1X ^, where y is fixed, it is seen that one of the assertions 
of (28) is clear from the definition (10) . In view of the completeness 
theorem of §9, the remaining assertion of (28) will be proved if it is 
shown that the function (25) is continuous at y = 0. In other words, 
it is sufficient to show that the transform ji°(y) of every distribution 
is continuous at y = 0. 

o 

Actually, the transform (.1 (y) of every \x is continuous at every yj 

moreover, every p°(y) is uniformly continuous for - Qo < y <0°. 

In order to prove this, let p°(y) denote, for every positive number 

r 

r, the contribution of the interval -r < x < r to the transform (10),; so 
that r 

(29) |Ay) p = y e ixy dn(x). 

-r 

Since the contribution of the interval -r < x < r to (x) = 1 tends to 

1 as r — ? <0* , and since | e 1X ^l = 1, it is clear that p°(y) • — |A°(y) as 

r 

r — ? 0 ° holds uniformly for - 0 »< y < 0 ° , and that it is sufficient 

to prove the uniform continuity of the function (29) for -0°< y <o° , if 
r is fixed. But, if y and h are arbitrary real numbers, (29) shpws that 
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llAy + h) r - n°(y) r .|=| I e^fe 1 * - 1 ) dn(x)| < | - l| d,..(x). 

-r -r 

Since the last integral is independent of y and tends, as h — 0, to 0 

for every fixed r, the proof is complete. 

It may be mentioned that, if [ 4 , n ~-> [4, , the transform |i °(y) must tend 
to the transform |4,°(y) uniformly on every fixed bounded y-interval. In 
fact, if (4, R • — ► \x, it is clear from the definition of [4^ — ? [4. (cf. the end 
of ^2), that (27~ ;f ) is satisfied. Hence it is clear from the definitions 
(29), (10), that it is sufficient to prove this: If (4^ — *|4,, then there 
exist certain arbitrarily large values of r such that |4^ 0 (y) — ^ (,i°(y) 

as n — > O® holds uniforEily on every fixed bounded y-interval. 

It is easy to see that there exist arbitrarily large values of r 

possessing this property. In fact, a positive number r will have this 
property whenever both x = r and x = -r are continuity points of the mono- 
tone function \L, that is, whenever the interval -r < x < r is a continuity 
set <£ [4, = [4 , e (cf. j>2). Then (5')> where = | 4 , in the present nota- 
tions, is applicable to the set E represented by the interval -r < x < r 
and to the function f(x) = e 1 ^, where y is arbitrary. This means, by (29), 
that the relation R n °(y) r — ^ |^°(y) r holds for every fixed y. In order to 
prove that it holds uniformly on every fixed bounded y-interval, it is 
sufficient to ascertain that the functions [4,-^°(y) r , (4,p°(y) T> , . . ., where r 
is fixed, are equicontinuous (Arzelh) on ever;/ such y-interval: for in- 
stance, that the functions [A n °(y) of y have uniformly bounded derivatives. 
But (29) shows that these derivatives exist and have the absolute values 
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i r ixeixy d ^ n ^ x )/ < j / x i d ^ n ( x )-> j'i x i d ^(x), 

-r -r -r 

by (5> ) . Since the last integral is independent of y and n, the proof 
is complete. 

The above results, when combined with the completeness theorem ( § 9) 
can be formulated as the following continuity theorem o f the transform (10) 
For a given sequence of distributions • • • on the x-line, there exists 

a distribution (.1 satisfying [i, Q — > \x if and only if the sequence of the 
transforms ^°(y), . is uniformly convergent on every fixed 

bounded y-intervalj in which case the function c6(y) = lira a °(y) is pre- 

n 

cisely the transform |j,°(y) of the distribution \). = lim . 
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Moments 



11. For a non-negative integer k and for a distribution p, the k-'th 
moment of p is defined to be the real number 

(30) (H) k = Jx k dp(x), if J|x|* dp(x) <oo. 

)x| dp = , then (p) k is considered, as undefined. Thus it is clear 

from (30) that the existence of the (k + l)-th moment of p is sufficient for 



the existence of the k-th. it is also clear that the O-th moment exists for 
every p, since (3) means that (p)^ = 1. 

J jc 

|x| dp < 00 and m < k, then the integral 

obtained by m-fold formal differentiation of (10) obviously is uniformly 

ni m m m 

convergent for -00 < y < «o . Hence, D p u (y), where D = d /dy , exists 

y V y 

and is represented by 

(31) D y p°(y) = i m J' e 1Xy x m d^(x), if Jl x| m dp < . 

In particular, from (30), 

(32) Dy fj. (0) = i (p) k , if J |x| dp < <50 . 

It is also clear from (31) that the k-th derivative of the transform is con- 
tinuous for every y. In particular, it is continuous at y = 0. It follows 

therefore from Taylor* s formula that 

k 

(33) H-°(y) = i m (p) m y m / ra i + o(ly| k ) as y — » 0, 

m=0 

since the assumption of (32) is satisfied for every index m <_k, if it is 
satisfied for m = k. 

Accordingly, if p has a k-th moment, then the transform of p behaves 

at y = 0 in such a way that there exist k + 1 constants c m satisfying 

k 

(33') 'ri°(y) = X c m y m + 0 (*y |k ) as y— >o. 

m=0 
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The converse of this result does not hold. In fact, if p is the 
distribution considered at the end of § 8, then [x°(y) = 1 + o(lyl) but 

J'lxldu = oo ; so that (33') is satisfied for k = 1 (by c^ = 1, = 0), 

although = (u) 1 does not exist. 

12. The index k is odd in the counter-example just given. It turns 
out that the converse of the result of § 11 is true whenever the index k 
is even. In fact, not only (33'), but- also the weaker assumption which re- 
places the o-remainder by the corresponding 0-reaainder, implies the truth 
of the converse, if k is of the form 2n. In other words, the existence of 
(p)2n is not only sufficient ( §11), but is necessary as well, for the 
existence of constants c ffl satisfying 

2n-l 

(34) M°(y) = ^ + 0(y 2n ) as y — >0. 

m=0 

In fact, it is useless to specify in (34) the term c m y m of degree m = k, 
where k = 2n, since this specification is canceled by the 0(y^ n ) i n (34). 
Thus the assertion to be proved contains, among other things, the curious 
statement that the assumption of 2n constants satisfying (34) necessitates 

the existence of a (2n + l)-st constant satisfying (33') for k = 2n. 
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If n = 0, the assertion to be proved is trivial, since (p.)^ = 1 

for every (j,. Hence, it is sufficient to show that, if the assertion is 
true for every index less than n, then it is true for n also. To this end, 
suppose that a (j, satisfies (34) for a given n. Then (34) is of course 
satisfied if n is replaced by n - 1. Since the assertion is supposed to 
be true for the index n, it follows that (u)2n-2 ex i s i s * 

According to § 11, this implies that (33) holds for k = 2n - 2. It 
follows therefore from a comparison with (34) > that c m = i m (|j,) m /m! if 
m < 2n - 1, where (^) m = J*x m dfx, by (30). Consequently, (34) can be 
written in the form 

O/ x r m 2n-l 2n 

H- (y) = > (ixy) /ml + c 2 n -iy + 0(y ). 
m=0 J 

Hence, if the real parts are taken on both sides, (10) shows that 

n-1 

J*cos yx d[A = J' (-l) k (xy) /(2k) l d^i + ay 2 ” -1 + 0(y 2n ), 
k=0 

A 1 

where a = a 2n _Q_ denotes the real part of ^ hus I S n ( x y) dja = ay 

+ 0(y 2n ) as y — » 0, if g n (q) is an abbreviation for 

n-1 

(35) g n (a) = cos a - (-l) k a 2k /(2k) 1 

k=0 

According to a standard property of the partial sums of the infinite 
series cos a = ^ (-1) a 2k /(2k)!, the function (35) does not change its 
sign for - oo < a <<*> . Hence, the function J'g n (xy) djj. of y cannot change 
its sign. On the other hand, this function of y was just shown to be of 
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the form ay + 0(y ), as y — > 0, and must therefore change its sign 

unless a = 0. Consequently, Jg n (xy) d[i = 0(y 2n ), and so, since g n (a) 
does not change its sign, J~|g n (xy) J = 0(y 2n ). Since this integral 
extends over the whole x-line, it follows that 



b/y 

_2 n p 

y J |s n (xy)| d|i(x) = 0(1) as y— »+0, 

- b/y 

where b is arbitrary. But it is clear from (35) that there exists for 
every sufficiently small b = b(n) > 0 a positive constant satisfying 

g n (a) > const. a^ n for -b < a < b. Consequently, as y — >+0, 



b/y 


b/y 


ii const. 


(xy) 2 ” dp(x) = 0(l), i.e., (* 


-b/y 


J 

- b/y 



x 2 “ d|i(x) = 0(1). 



Since b/y — * oo as y — *+0, the last relation completes the proof 
for the existence of ([^) 



13. It is clear that 

(36) J^|x|d|.i(x) <00 implies Jlx + c| d^i(x) <00 

for every real number c. This means that, if C [a( x ) denotes, as before (16), 
the translated distribution p.(x - c) , then the existence of the first 
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0 c 

moment of \i = ^ implies that the first moment of [A exists for every c 

and that 

( 3? ) = (n) x + c. 

In fact, (37) follows from (30), since (^) Q = l. 

If (n)-^ exists, let p,* denote the distribution c p obtained from p 
by the translation c = (p)^j so that 

H*(x) = H( x " (^i), i.e., ((x*) 1 = o, 
by (37). A distribution will be called "centered" if it has a vanishing 

first moment, i.e., if it is a distribution (A*. A distribution p can be 
made centered only if it has a first moment, since otherwise the distri- 
bution p* is undefined. 

IV. The rule (37) states that, if (p)^ exists, any translation of the 
x-axis subjects the value of (p)^ to the same translation. This rule 
would not have been true if, in contrast to the agreement made after (30), 
the first moment (p)^ would have been considered as defined whenever 
J x dp(x) exists as a principal value, i.e., whenever the contribution of 

the symmetric interval -r < x < r to J*x dp (x) tends to a limit as r — * 00 . 

In fact, the violation of the identity (37) in case of such principal inte- 
grals is illustrated by the absolutely continuous distribution having the 
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p _1 

even density j.i> (x) = const. (1 + x~) 
symmetric, is not centered, since 



C orre spondin gly , 
does not exist. 



this j.t, 



though 



15. Suppose that a has a second moment (;i) 0 = < <y? . mh e n 

obviously f(x + c) 2 dj.1 < o* for every c. This means that c u(x) = jj,(x - c) 
has a second moment, and therefore certainly a first moment. Thus, from 
(30), 

(39) ( C ^) 2 = (..) 2 + 2c0i) 1 + c 2 , , 

since (H-)q = 1. In particular, from (38) and (39), 



(40) 



(^’) 2 = (|i) 2 - 40-^ = j\x - ( ji )-,) 2 dj.i(x) . 



Clearly, the non-negative integral (40) is 0 if and only if x 5 

is the only point of increase of the non-decreasing function !.l(x), i.e., if 

c 

and only if ^ is a translation e of the unit distribution e. It follows 

therefore from (40) that (n*) > 0 or ((*’) = 0 according as j.i is or is 

c 

not a distribution e, i.e., according as (,1* is or is not the distribution e. 

In view of the remarks made with regard to (37) , the proviso of the 
definition (30 ) cannot reasonably be disposed of, if 3c is odd. But if k 
is even, it will froir^iow on be convenient to cancel the agreement made 

after (30), i.e. to use the notation (pi) 2n = J x^” d|.i whether (^) 2n < 0° 
or (|r) 2n = 0* • 

If this understanding is applied to the case 2n = 2, it is possible 
to define the "standard deviation" of an arbitrary distribution u, as fol- 
lows: The standard deviation, to be denoted by , is non-negative and 
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its square (the "variance" of f.l) is given by 



(41) 




^ 09 if (u.) 2 = » 

((^ 2 - (^)i 2 = ( n ’) 2 n (n ) 2 < <*; 



cf. (40). In particular, = 0 if and only if (4. = 



i ♦ s • ' 



H' 



e. 



16. Let [,i be a distribution possessing finite moments ().(,)^ of every 
index k or, what is the same thing, of every even index k. According to 
£ 12, this will be the case if and only if (33) holds for every even k, 
and therefore for every kj so that (32.) and (31) are valid for every k 
and for every real y. Accordingly, all moments of a distribution }t(x) 
exist if and only if its transform fi°(y) has a Taylor series 



CP CP 




where it is understood that this power series can diverge for every y ^ 0. 

It will now be shown that, if the Taylor series (42) happens to con- 
verge for some y / 0, and therefore on some interval, say on -q < y < q, then 
the sum of (42) must be the function f4,°(y) for every y on this interval. In 

other words, the behavior exhibited by the Taylor series, 0 + Oy + Oy +..., 

2 

of the function f(y) = exp ( -l/y ) cannot occur for functions which are 
transforms f(y) = [,i 0 (y) of distributions |.i(x). 
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In order to prove this, let f(y) be a function possessing deriva- 
tives of arbitrarily high order on an interval -q < y < q, and let p be 
any positive number less than q. Then it is known that the Taylor series 



£ 



lc ic 

D f(0)y /ki converges to f(y) on the interval -q < y < q if and only 



if, for every fixed p less than q, the maximum of the absolute value of 

k * k 

Dy f(y) on the interval -p y < p is 0(kl/p ) as k — > . In the 

present case, where f(y) = |A°(y), this condition can, by (31) > be written 

in the form 

(43) Max | d(.i(x)| = 0(kl/p x )as k — ^ © o , where \y\ < p < q. 

f k 

Clearly, the expression on the left of (43) cannot exceed \ | x| dj.i for 



any k, and, according to (32), is identical with j x dfj, for every even 
k. On the other hand, it is easily verified from Schwarz's inequality 
and from Stirling's formula, that the estimate 



(44) J | x \ d.[X = 0 (k i/p ) as k — O®, ( p < q), 

must hold for every p, if it holds for ever/ p when k is testricted to be 
even. But (30) shows that (44) can then be xvritten in the form 

(45) (|-0 2n = 0((2n)i/p )as n — * ©o , (p < q) . 

Consequently, either all or none of the three conditions (43), (44), (45) 
are satisfied for every p. 

In particular, (43) is equivalent to (45). Hence, the series (42) 

converges to the sum |.t°(y) on the interval -q < y < q if and only if (45) 
is satisfied for every p. Consequent!]* - , all that remains to be ascertained 
is that the mere convergence of (42) on the interval -q < y < q necessitates 
the estimate (45) for every p. But this is obvious, since the convergence 
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of the series (42) on the interval -q < y < q implies that the terms of 
the series form a bounded sequence of numbers for every y contained in 
the interval, and therefore for every p. 



17. Suppose that (42) converges for some 7 / 0, say for -q < y < q. 
Then the result of § 16 can be formulated as follows: Let w l»e a complex 
variable, and Let y and v denote its real and imaginary parts respectively. 

Then there exists in the circle | < q a regular function, say H-°(w), 
which becomes identical with the transform (10) on the real diameter of 
the circle. 

It will now be shown that the function |u°(w) thus defined in the 
circle (w\ < q always admits of an analytic continuation which is regular 
in the whole strip l v 1 < q about the y-axis, and that, in addition, |o.°(w) 
is a bounded function on every narrower strip, | v| < p. 

First, it is seen from ([-O^ = § x"' 11 dj.i that, if the distribution \i 

and the positive number p are arbitrary, then 

(46) £/, x|" d|.i(x) p ‘Ai = J* exp (p(x|)d|u(x), 

k=0 

where it is understood that either side can be O® . Suppose that is 

such as to make the integral (46) finite for a certain p. Then it is clear 
that the integral which results by writing w = y + iv in place of y in (10) 
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is uniformly convergent and bounded on the strip |v| < p. Hence, it is 

sufficient to ascertain that the expression (46) is finite for every 
p < q, if (42) converges for -q < y < q. But it was shown in § 16 that 
(42) converges for -q < y < q if and only if (44) holds for every p < q. 

Since this clearly is the case if and only if the series (46) converges for 
every p < q, the proof is complete. 

It follows, in particular, that a transform p°(y) must be regular- 
analytic on the whole line - 00 < y <00 whenever it is regular-analytic 
on some interval -q < y < q. This obviously implies that, if two trans- 
forms, say p^_ (y) and fx^ °(y), are identical on an interval -q < y < q on 
which n 1 °(y) is regular-analytic, then they must be identical for -oo< y < 00 • 

which means, by (19) , that p^ = P 2 . This criterion contains a substantial 
generalization of the fact deduced in 8 from (20) , namely of the fact 
that n = e if p°(y) is constant on some interval -<J < y < q. 

18. The above results have a certain, somewhat loose, connection with 
Stieltjes' problem of determinateness, and lead to the following sufficient 
condition for uniqueness: 

(47) 2n-th root of (p) 2 n < const, n. 

(A somewhat sharper sufficient condition, namely 

00 

(47') 2n-th root of (p)„ = 00 , 

n=0 ^ 

is supplied by the theory of qua si -analytic functions). 

Stieltjes found that there exist sequences of numbers which are moment 

sequences of several distributions. In other words, there exist pairs p^,P 2 
of distributions for which (p^)^ (pg)^ exist for every k and (p^)^ = (p 2 )^ 
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holds for every k, although |Xj_ ^ (Xp. Let a distribution jx be called of the 
determined type if (jx)^ exists for everyy k and if (X = A, follows for every 
distribution A. for which (A.)k exists and equals (ix)^ for every' k. A necessary^ 
and sufficient condition in order that a given |X be of the determined type 
results from the spectral theory of the infinite matrices of the continued 
fractions associated with the Stieltjes transform J'Cx - z) - -*- d[x(x) . However, 
this criterion is all too implicit to be "applicable" in general. 

In order to prove that (47) is sufficient for a [i of the determined 
type, suppose that ((X^^ and ([Xp)^ exist and are equal for every k, and let 
(p,)^ denote their common value. It is clear from Stirling’s formula that (47) 
is equivalent to the existence of a positive number p (or q) satisfying (45). 
According to § 16, this will be the case if and only if there exists a q such 
that the two power series (42) belonging to [X = and p, = (Xp converge for -q 
< y < q to |X^°(y) and to [Xp°(y) respectively. But these two power series are 
identical, since ((X^J^ = (txp)^ f° r every k. Hence, {x-^ = ^ follows from the 
criterion pointed out at the end of § 17. 

If the regular -analyticity of fx°(y) near y = 0 is replaced by its 
quasi-analyticity near y = 0, the generalization (47') of (47) follows by 
repeating, word by word, the above proof of the criterion (47). 

Incidentally, no explicit examples of distributions |X appear to be 
given in the literature which would prove that (47), (47') and a necessary 
and sufficient criterion for a distribution of the determined type are 
actually distinct. 
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Convolutions 



19. For any pair Hi = H2 = distributions, let 

Hj*H 2 = Hl*H2 ( x ) denote denote the function 

(48) Hi * = I J ^lfO dn 2 (u) 



v+u<x 



(so that Hi * H2W = H2 * Hi( x )» since (48) is symmetric). Since 

f V 

J d Hi( v ) = J dH x (v) = Hi(t - 0) - Hi(- y ) = Hi(t - 0), 



that 



vKt _ <y} CP 

it follows, by placing t = x - u and M. 

-<y 

(48*) Si! dHi(v)]« dn 2 (u) = j* Hi( x - u - 0) d^fa). 

v<x-u 

But the repeated integral on the left of (48*) is identical with the 
double integral (48). Moreover, Hi^ x - u - 0) on the right of (48*) can 
be replaced by Hi( x - u) j cf . (8) . Consequently, 

(49) Hi * H2W = ^ Hi(x - u) <4^ (u). 



Since H = Hi and h = H 2 8X6 “on-decreasing functions satisfying 
(8), it is easily seen from (48) that H = Hi * H2 is a “on-decreasing func 
tion satisfying (8). Thus (49) defines a distribution Hi * H2 f° r every 
pair of distributions Hi, H2* 

The distribution Hi * H2 I s called the convolution of Hi and H2 
It is characterized by the property that, if f(x) is any Baire function 
which is either bounded or non-negative, then 

(5°) J I f(v + u) dH x (v) dn 2 (u) = j* f(x) fa * h^( x ) 



(the double integral on the left extends over the whole (u,v) -plane) . In 
fact, (50) follows from (48) by applying Fubini*3 theorem onthe product 
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space of the lines -c»< v <00 , -0®< u <Q* which cany the respective 
measures |i^>H2* Actually, Fubini*s theorem was used even in the transi- 
tion from (48) to (48' ). Correspondingly, (50) becomes identical with 
(48) if f (v + u) is chosen to be 1 or 0 according as v + u is or is not 
less than x. 

Since y (^(u) = 1, the integral (49) can be thought of as mul- 
tiplied by 1 / J' dp^n). Then (49) means that the distribution ^ ^ Is 

the average, with reference to the weight function H2> of all translated 
distributions U ^(x) = [x^(x - u) belonging to a given distribution |^(x). 

This interpretation of the convolution of Hi and h 2 seems to 
be unsynanetric but is not; in fact, as pointed out after (48), 

(51) Hi * V-2 ~ 1*2 * Hi* 

In addition, if Hi> jig* H3 are three distributions, then Hi * H2 * H3 I s 
a unique distribution, since 

(52) (h x * H 2) * H3 = Hi * (H 2 * H3). 

In fact, if v + u and ^ ^dHi(v) dn 2 (u) in (48) are replaced by v + u + t 
and f jf d Hl(v) dH 2 (u) dn 3 (t) respectively, the associative law (51) fol- 
lows for reasons of symmetry. Another proof of (51) and (52) is contained 
in (53) below. 
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20. If (50) is applied to the bounded continuous function 
isor 

f(x) = e , where y is fixed, (10) shows that (50) becomes 

(53) Hi°(y) n 2 ° (y) = ([^ * |* 2 ) (y). 

Thus the product of the transforms of two distributions is again a trans- 
form, namely that of the convolution. 

For instance, (14) and (18) imply that 

(54) |n°(y)| =( u*|*)(y) 
for every \x. Similarly, from (l6), 

C O , C ,0 C C 

(55) fi = ( e * \i) , i.e. p = e * H, 

by (19). This means that the operation of translating jx = [x(x) into 
C \>. (x) = |x(x - c) is equivalent to the op«*ation of convoking jx = [x(x) 
with the translated united distribution e(x) = e(x - c)j a fact which 
is clear from (49) also. 

If and X n , where n = 1,2,..., are two convergent sequences of 
distributions, then 

(56) ^ n * \, if [x n -V (x and X n ~^ X. 

In fact, (56) follows by applying (53) to both * V>2 = * ^n ant ^ 

H ^ = H * X, if the resulting identities are combined with the facts 
which result when (28) is applied to all three sequences fx^ X^, * X n# 

More recondite than the "multiplication rule" ( 56 ) is the "division 
rule", according to which 

(57) Xjj — *s, if (Xjj-* }x and ^ * Xq— *■ jx, 
where e denotes the unit distribution. 
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First, ((Xjj * X n )°(y) = ^(y) X^ty), by (53). It follows there- 
for?! from (28) that, in virtue of the assumptions of (57), both limit rela- 

tions n n °(y ) — y |Ay), i J ' n °(y) ^ n °(y) — > |f(y) hold for every y. Consequently, 
X n °(y) — fl holds for every y satisfying H^°(y) ^ 0. But K n °(y) f 0 is 
certainly satisfied for every y contained in a sufficiently small interval 
-q < y < q, since, on the one hand, it was shown after (29) that every H°(y) 
is a continuous function and, on the other hand, |^°(0) ^ 0, by (12). Conse- 
quently, l n °(y) — ^ 1 for -q < y < q. It will now be shown that X n °(y) — > 1 

cannot hold for -q ^ y < q unless it holds for - 0°< y <0° . If this is 

shown, then the assertion of (57) follows from (28), since 1 = e°(y), by 
(15). 



21, The proof will be such as to apply not only to the distribu- 
tion s but to any distribution, say X, which has the property that every 
transform identical with the transform X°(y) on an interval -q < y < q is 
identical with X°(y) for - O® < y <0^ . It was shown at the beginning of § 8 
that s is such a X, and it turned out at the end of £ 17 that such a X re- 
sults as soon as X°(y) is assumed to be regular-analytic at y = 0. 

In order to facilitate references to preceding sectiais, let the 

notations Xq,! be changed to respectively. Then the assertion is that, 

o o 

if and \i are distributions satisfying [4^ (y) — > | x (y) for -q < y < q> 



44 



and if [A has the particular property that a transform cannot be identical 
with (A°(y) for -q < y < q unless it is identical with [A°(y) for -0»< y <0» 
then |A n °(y) — ^ [A°(y) must hold for y <o<» (which means, by (28), that 

H-n H 1 ) • 

The proof consists of an adaptation of the proof used in § 9. In 
order to see this, apply Helly* s selection theorem in the same way as be- 
fore (27*). In other words, choose from the sequence of the distributions 
Hl,^2i*»* a subsequence |a^j, ^(?)>*»« w* 11 - 0 * 1 converges to a non-decreasing 
bounded function at every continuity point of the latter, and normalize this 
limit function so as to become continuous from the left at every x. Letfthe 
limit function thus normalized be denoted by lim » For the present, it 
is undecided whether or not the total variation of lim ja^j is 1, i.e., 
whether or not lim is a distribution. 

In £ 9, the proof that lim is a distribution was based on 

(27), and the assumption was that (A n °(y) — > p°(y) for - & < y <'&> . How- 
ever, since l/r — ^ 0 in (27), this part of the proof remains valid under 
the present assumption, according to which [A n °(y) — > p°(y) , instead of be- 
ing valid for -o*< y , is known to hold only on some interval -q < y < q. 
In other words, it follows in the same way as in £ 9, that lira |A^ is a 
distribution. 

This means that ^ lim holds in the sense defined at the 

end of £ 2 . Hence, if f(x), ^ n ), (A in ( 5 ) are replaced by e**^, [A^, 

O o 

lim respectively, it follows from (10) that (y) — > (lim |A^ n j ( 7 ) 

for - o» < y < 0 ® . On the other hand, if -q < y < q, then, by assumption, 

^ (7) — ^H°(y)> 311(1 therefore ja^ (y) — >|A 0 (y), the sequence 
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O o 

being a subsequence of • Consequently, lim (y) = [i (y} 

holds for -q < y < q. But this implies, by the particular property 

O / \ o 

assumed for [i (y) , that lim = |i. 

Accordingly, the relation — * lim can be written in the 

form ^ |i. Since [i is given with the sequence is there- 

fore independent of the choice of the subsequence 

as in § 9, from the principle of the excluded middle, that the proof of 
H n — * \i is complete. 
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22. From here on to (62) below, let f(x), where -c°< x <O 0 i 

not 

denote a Baire function which is non-negative, even, and does A decrease 
as \x\ increases, i.e., which for every x and for every r > 0 satisfies 
the following conditions: 

(58) f(x) = |f(x)| = f (1x0 £f(jx| + r). 

There will be needed two sufficient criteria, to be expressed 
in terras of additional restrictions on f(x). The first of these criteria 
will ensure that 

(59) ^f(x) df*(x) <0» 

holds for both H- = M- x and |i = ^ whenever it holds for \l = ^ * n 2 , while 
the second will ensure the converse. It turns out that 

(60) f(x + u) = 0(f(x)) as x — ?<?>for every fixed u 



and 
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(6l) f(x + u) = 0(f(x) + f(u)) as x + u — * O* 
are sufficient criteria of the first and of the second kind respectively. 

In other words, (58) and (60) ensure that (59) holds for both 
[A = H}, and V- = (J>2 whenever (59) holds for (A = ^ * (A^, while (58) and (6l) 
ensure that (59) holds for [A = [A^ * ^ whenever (59) holds for both [i = |A^ 
and (j, = |^2 • The proofs will depend on the fact tbat,f(x) being a non-negative 
Baire function, (50) is true whether (59) does or does not hold for |A = (A-^ * V-2 
(in fact, (50) resulted by a mere application of Fubini's theorem, and is 
therefore valid even if either side of (50) becomes for a non -negative f). 



f satisfies (58) and (60), and that (59) holds for a given [A = Hi * 

The latter assumption means that the integral on the right of (50) is finite. 
Hence, if the double integral on the left of (50) is thought of as a repeated 
integral, it follows from f _> 0 that the interior integral on the 
left of (50) must be finite for at least one u. In other words, there must 
exist a real number c satisfying J"' f (x + c) dfi^(x) < O® . But then the 
identities (58) and the inequality (58), when combined with the estimate 
(60), imply that (59) holds for ja = [A^. In view of (51), this completes 
the proof of the first criterion. 

In order to prove the second criterion, suppose that f satisfies 
(58) and (61). Then the integral on the left of (50) cannot exceed a con- 



simple integrals represented by the cases |A = |A^ and [a = [Ap of the integral 
on the left of (59). Hence, if (59) holds for both \x - [A^ and (A = ^y then 



In order to prove the first of the criteria stated, suppose that 



stant multiple JJ (f(v) + f(u))d^(v) d^fa). But, since 
and J ^2 = d» this double integral is identical with the 




sum of the two 
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the integral on the left of (50) is finite. Since the integral on the 

right of (50) is identical with the case ^ # ^ the integral on 
the left of (59), the proof of the second criterion is complete. 



23. As an application, let f(x) = jxf^ , where k > 0. Then 

(53) and (60) are satisfied. In addition, (61) is satisfied, since 
k . k k 

|x + u\ < |2x| + |2u| . Consequently, both criteria are applicable. 

Accordingly, ^)x)^ <fyi(x) < C® holds for [A = (4.-^ * if and only if it 
holds for both ^ and [i = nowhere k is any positive constant. If k 

is an integer, it follows that has a finite k-th moment if and 

only if both ^ and possess finite k-th moments; in which case ap- 
plication of (32) to all three distributions gives 

(62) (Mi *H 2 >k = 21 (n^^lVMkV ( ^ 0 = 1 ' 

m=0 

if the product (53) is differentiated k times. 

For instance, (< r i^ * (.ip)-^ exists if and only if both (f.i^)^ and 
(^2)1 e:c ^ s ^» ^ w * 1 *- c * 1 case (62) gives 

( 6 3) 0*1 * H 2 >1 = (l* 1 ) 1 + (n 2 )i 

and implies therefore, by (38), that 

0*1 * 1*2) = 0*3.) " * (|*2^ 



( 64 ) 
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Similarly, (|x^ * ^ 2)2 finite if and only if both (jx-j^ an< ^ ^2^ are 
finite; in which case (62) shows that 

(65) (l*l * |X2)2 = (Hi)2 + 2 (l*i)^(|*2)l + (^)2» 
and so, in particular 

( 66 ) (fV * f * 2 *) 2 = (|*!*)2 + ( f * 2*)2 

in virtue of (64) and (68). In the contrary case, the first line of the 
definition (41) gives 

(67) * t* 2 ] 2 = M 2 + /> 2 ] 2 * 

since then both sides of (67) are o® • It follows therefore from (66) and from 
the second line of the definition (4l)> that the standard deviations satisfy (67) 
for any pair of distributions Hi>|*2* 

It is clear from the proof given in §22 for the sufficiency of condition 
(6l) that (61) can be replaced by 

(6. bis) f(x + u) = 0(f(x) f(u)) as x + u — > <>>• 

Obviously, this condition and both (58) and (60) are satisfied if f(x) = exp px, 
where p is a positive constant. Consequently, the integral (46) is finite for 
|x = jx^ * II and only if it is finite for both (x = |x^ and (x = fX£. But it was 
shown in § 17 that the integral (46) is finite for every p < q if and only if 

the transform |X°(y) is regular-analytic for -q < y < q; in which case the trans- 

form admits of a regular analytic continuation |x°(y + iv) into the strip 1 v| < q. 
Accordingly, this will be the case for the transform of ^ * pg and for a given 
q if and only if it is the case for the transforms of both [X]_ and ^2 sod for 

the same q. The "only if 1 ' part of this fact is by no means obvious from (53). 
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24. If A and B are two (not necessarily disjoint) sets on the 
x-axis, let their "vectorial sum", A (+) B, be defined as the set of those 

numbers x which are representable in at least one manner as x = a + b, 
where a and b are restricted to be in A and in B, respectively. It is 
understood that 

(68) A (+) B is vacuous if either A or B is. 

If both A and B are single points, a and b, then the vectorial addition, 
a (+) b, is equivalent to the ordinary addition, a + b. If B is a single 
point, b, then A (+) b is the set obtained from the set A by the transla- 
tion b. Generally, 

(69) A (+) B = B (+) A. 

If C is a third set, the set A (fc) B (+) C is unique, i.e., 

(70) (A (+) B) (+) C = A (+) (B (+) C) . 

If both sets A,B are closed, the set A (-t) B need not be closed. 
In fact, if A and B consist of the points a = n and b = n”^ - n respectively, 
where n = 1,2,..., then A and B are closed but A (+) B contains the points 
n~^ without containing the point 0 = lim n~\ However, it is readily shorn) 
that, if A and B are closed, then A (+) B must be closed as soon as either 
A or B is bounded. 

It is easy to see that, if S = S(|i) denotes, as at the beginning 
of § 4, the spectrum of a distribution \i t then 

(71) S([i^ * |* 2 ) = closure of S^) (+) S(|i 2 ), 
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and that the corresponding rule for point spectra is 

(72) =P(H 1 ) (+) P(|i 2 ). 

In fact, (71) and (72) follow directly from (48) and from the definitions 
of S(|i) and P(p,) in §4> if the set-functions p, = fig, used in f4, are re- 
placed by the corresponding point-functions |x = p(x), defined in £5. 

(The discrepancy between (71) and (72) is due, of course, to the fact 
that S(p) always is, while P(p) need not be, a closed set; cf. f 4)» 

In addition, 

(73) lim S([ijj) contains S([x), if — 9 p. 

Here lim Aq can be defined, for a given sequence of sets to be 

the set of those points x for which a n — ?> x as n — * o° holds for suitable 
points a^ = a^(x), a^ = agtx),... of A^jAgt... respectively. Unis lim A^ 
always exists (but can be vacuous even if none of the A n is). With this 
definition of lim Ajj, the truth of (73) is clear from the definitions of 
the symbols S(p) and ^ — » p, introduced at the beginning of § 1 + and at 
the end of £2 respectively. 

Trivial examples show that the converse of (73) is false, i.e., 
that (ijj — * p does not imply the relation lim Stf^) = S(p); and that nothing 
can be said if S is replaced by P (in fact, neither of the sets lim P(|i n ), 

P(lim Hjj), where p^ — ? jp , need be vacuous when the other set is 
vacuous) . 
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25. For a given distribution | x on the line -oo < x < oo and for 
every positive number s, let d = d(s) denote the least upper bound of 
If^x 1 ) - ^(x")| when jx' - x" I < s. Thus d(s) is a non-decreasing function 

satisfying d(oo) = 1 and d(+0). > 0, where the sign of equality, i.e. d ^0 

as s — »0, holds if and only if the distribution [X is continuous (in fact, 
since |x(-«o ) = 0 and |x(«>) = 1, it is clear that jx(x) cannot be continuous 
for every x unless it is uniformly continuous for -°o < x <°o). Incident- 
ally, it is easily seen that, whether [X is continuous or not, d(s + t) can- 
not exceed d(s) + d(t), where s > 0 and t > 0 are arbitrary. 

Since |x* - x w | = |(x' - u) - (x n - u) / for every u, it is clear that 
| fj, (x* - u) - jx(x w - u)| < d(s) whenever j x' - x" | < s. Hence it is seen 

from (49) that, if [x^ = |x, ^ = then the difference 

l|X * X(x') - [J. * l(x") | < - |x(x n - u) | dl(u), where Jdl(u) = 1, 

cannot exceed d(s). In other words, if d^ 2 (s). dj_(s), d 2 (s) denote the 
functions d(s) belonging to |X = [X]_ * [X 2 , }X]_, 1^2 respectively, then, for every 
s > 0, 

(74) d 12 (s) ^^(s), and - so < d 2 (s), 

by (51). This implies that d-^ 2 (+0) cannot exceed the smaller of the numbers 
dq(+0), d2(+0) . This mea.ns that the greatest saltus of [Xq * [X 2 cannot exceed 
the greatest saltus of |X]_ or that of p, 2 , where it is understood that the 
greatest saltus of jx(x) is the maximum of |x(x + 0) - u(x - 0) for -°o< x <00 
and can, therefore, be 0. For the latter case, (72) supplies the following 
refinement of what thus follows from (74) : 
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(75) The distribution [i^ * ^2 is continuous if and only if at least one 
of the two distributions p.^, P2 is. 

In fact, (75) follows from (72) and (68), since, according to §9, 
the point spectrum P(p) is vacuous if and only if the distribution p is 
continuous. 

The inequalities (74) represent just one manifestation of a fairly 
general, though somewhat vague, principle, according to which the function 
pi * ^2 ^- s l eas t aa "smooth” as the "smoother" of the two functions 
[^2* Other illustrations of this principle will be seen below. Still another 
manifestation of it can be formulated as follows; 

(76) If either p]_ or P2 is absolutely continuous, then p^ * [0.2 is abso- 
lutely continuous. 

In view of (75), it is of interest that the converse of (76) is false j 
in fact, it is so fundamen tally false that 

(77) p-j_ * H2 caD absolutely continuous when both p-]_ and P2 are purely 
singular (cf. $4). 

An example proving (77) will be given in § 63 below. 

The proof of (76) can be based on the formula line preceding (74). 
However, the following proof of (76) is preferable, since it supplies, be- 
sides the assertion of (76), the explicit form of the density of P]_ * P2, 
if either p^ or pg has a density (cf. §4). 

if m(x) is any distribution, then, since it is a monotone function, 
it has a finite, non-negative derivative p^’(x) almost everywhere (here and 
in the sequel, the "almost everywhere" refers to the Euclidean measure on 
the x-axis). If [^(x) is another distribution, then, since p^’(x) > 0 
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almost everywhere, the Lebesgue-Stieltjes integral 
(78) 0(x) = - u) d^u) 

exists, and is non-negative, for every x, where it is understood that> 

(78) can be o® for some x (possibly not only almost everywhere). In any 

case, an application of Fubini’s theorem to (78) gives 
x x 

(79) | 0(x) dv =J^ > ^ |y(v - u) dv^ 1 dj* 2 (u). 

-o® — qo 

This holds for any pair of distributions Suppose now that, 

while |*2 is left arbitrary, is absolutely continuous. Then the interior 

integral on the right of (79) is ^(x - u) for every x and every u. Hence, 

if (79) is compared with (49), it follows that 

x 

(SO) |i^ * [* 2 ( x ) = J* 0(v) dv 

-o° 

for every x. But the function on the left of (80) , and therefore integral 
on the right of (80), if finite for every x. Hence, the integrandof ( 80 ), 
i.e. the function (78), is finite almost everywhere. Thus (80) means that 
0 is an L-integrable function, which means, again by (80), that (*i * (*2 is 
absolutely continuous. Finally, if the Fundamental Theorem of Calculus is 
applied to (80), it follows that the derivative of [*]_ * |*2» that is, its 
density, is represented by (78) for almost all x. 

If both and |*2 are absolutely continuous, and if 0]_,02 »$ 12 ^ e ~ 
note the densities of *^2*^12 respectively, then (80) can be written in 
the symmetric form 

(81) $ 12 ( x ) = jVl( x “ u ) du * 
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Convergent Convolutions 



26. If are n distributions, (52) shows that |i]*...*p. n 

is a unique distribution,; it has the transform 

(82) (Ha*.-.*v l n ) 0 (y) = ^(y) ... y n °(y) 

and the squared standard deviation 

(83) [Hi * ... * P3 2= M 2+ "• + h L nf' 

(<<y ), as seen from (53) and from (67) by complete induction. Similarly, 

from (70) and (72), 

(84) P(m * ... * I**) = P(m)(+) ... (*) P(|^), 
while, from (71), 



(85) * ... * [ijj) = closure of S^) (+) ... (+) S^). 

Correspondingly, if c^,...,^ are real numbers, then, from (51), (52) and 



(55), 



c n c i+. . *+ c n 



(86) ^ ^ = e * * ... * 1^, 

since (15), (49) and the definition °n(x) = |i(x - c) imply that 



(37) \ * b . - 



In particular, if (^)^,...,([i n )^ all exist, then, according to (63) 
(37), 



( 88 ) 



( 



3 1 c n . y ."y , 

Hi * — * H n )i <H k ) 1 , 



t=l k=l 



and so, by (38) or ( 64 ), 



(89) (^ * ... * |* n ) = ... * 

where the existence of the moment on the left of (88) , and therefore the 
existence of the distribution on the left of (89), is part of the statement. 
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It will often be necessary to consider the total variation of 



the distributions \i = on the exterior of a fixed interval 

-r < x < r, that is, the integral on the left of (22). As n — > , 
the numerical value of r turns out to be immaterial for the purposes at 
hand, and it will therefore be chosen to be 1. As an abbreviation, put 



(90) 



f 1 1 

= M - j" d|ij so that [i = 1 - J*djx(x) > 0, 
~ o ° -1 -1 



by (l) and (3). It is easy to see that the symbol (90) cannot have any 
explicit property of the type (83). 



27. For sake of shortness, let always 

(91) £ = = ~n , while j = y . 

1 1 - ■o° 

If g^,g 2 » .. . is any sequence, the symbol^. g Q has two different 
meanings. On the one hand, it denotes just the ordered collection g^, 
gl + g g,..., which may or may not tend to a limit, and, on the other hand, 
it denotes the value represented by the limit whenever the latter exists. 
Correspondingly, if ••• is a sequence of distributions, let the 

infinite convolution * V-2 * ••• mean > on the one hand, the ordered 

collection of all distributions * ... * which may or may not tend 
to a limit distribution, and, on the other hand, let It denote the limit 
distribution whenever the latter exists, i.e., whenever there exists a 
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distribution \l satisfying * • • • * — * H- as n — ^ ^ J in which case 

the infinite convolution will be called convergent (and in the contrarjr 
case divergent). Corresponding to (91), let denote the infinite 
convolution l^i * H>2 * in either case. Needless to say, this notation 
involves for the symbol * two logically different meanings, as exempli- 
fied by the formal identity |Xq* (*Vn) = *V l n 

By a convergent convolution will be meant a convergent infinite 
convolution . 



27. If ^ and Xjj in ( 56 ) are replaced by ... * jx n and 
X-L * ... Xjj respectively, it is seen that 
(92) (%) *(*\) = *(^ * XJ 

holds for any pair of convergent convolutions * 1 ^, where the con- 

vergence of the convolution on the right of (92) is part of the statement. 

a 

It is clear from (15) and from the definition e(x) = e(x - a) 

that a n — > a holds if and only if the distribution s tends to a limit 

a 

distribution j in which case the latter is s. It follows therefore by 

a two-fold application of both (86) and of the criterion (92), that, if 

c n 

■kUq is a convergent convolution, the infinite convolution * u, is con- 

n 

vergent if and only if the series c n is; in which case 

Q 

* *V n = C e * (*V- n ), where c = c n . 



(93) 
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Another consequence of the criteria] (92) is that, if 
= * ^2 * ... is a convergent convolution, the infinite convolu- 
tion ^ n+m = * ^n+2 * * * * * s convergent for every fixed m and 

satisfies the relation 

(94) 0*i * ... * u) * (*u ) = *u . 

-*■ in' v “n+m Y n 

This is seen from (51) and (52) , if it is observed that, according to 

the case c = 0 of (55), the unit distribution e satisfies [* * e = [*, 
|**e*e = |X, ... for every f*, and that every finite convolution can 
accordingly be thought of as a convergent infinite convolution. 

If \i n is replaced by ... * [* n in the continuity theorem at 
the end of § 10, it is seen from (82) that an infinite convolution 
is convergent if and only if there exists a continuous function to 
which the product on the right of (82) tends, for -©»< y <o» , as 
n o 3 J in which case the transform of the distribution *[* n is 

(95) (•%) (y) =Tk°(y), 

and (82) tends to (95) uniformly on every fixed bounded y-interval. 

In this criterion, the product on the right of (95) cannot before- 
hand be interpreted as uniformly convergent, or for that matter con- 
vergent, in the sense of the theory of infinite products IT g n . m 
fact, the latter theory declares g n to be convergent only if there 

exists an N such that the number gjj does not vanish for any n > N and 
the partial products of \ \ g tend to a non -vanishing limit as 

n — * Q° ; and both of these conditions imposed on N are required 
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to be satisfied by a constant N, if TT Sn is called„uniforraly con- 



vergent product of functions g_. On the other hand, the above criterion 
refers merely to the uniform convergence of the partial products of the 
infinite product on the right of (95). However, these two interpreta- 
tions of the uniform convergence of an infinite product, interpretations 
which clearly are inequivalent in general, prove to be equivalent in 
the present case. 

In fact, suppose that (82) tends to (95) uniformly on every fixed 
bounded y-interval; so that the convergence of the infinite product (95) 
is not required in the sense of the theory of infinite products. Since 
(95) represents a transform, it is, by § 10, a continuous function which, 
by (12), cannot vanish for -q < y < q, if q is sufficiently small. Con- 
sequently, none of the factors on the right of (95) vanishes for -q < y < q, 
and so the infinite product (95) is uniformly convergent for -q < y < q, 
if uniform convergence is meant in the sense of the theory of infinite 
products. It follows that |.t °(y) — >1 as n — * ov holds uniformly for 
-q < y < q, if q is sufficiently small. But this implies, by the end of 
£ 20, that ' r l n ° (y) — 1 as y — 5> coo holds uniformly on fixed bounded 
y-interval. Hence, there exists for every bounded interval an N such 



that |a n (y) does not vanish on the interval whenever n > N. This proves 
that the two interpretations of the uniform convergence of an infinite 
product are equivalent in the present case. 

As an application, it is easy to show that, corresponding to Cauchy’s 



o 



criterion for the convergence of an infinite series 
convolution is convergent if and only if 
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m Vl * ... *(i n+k(n) — e,as n-?<oo f whenever k = k(n) > 0 
(that is, whenever k is a non -negative function of n)» 

In fact, (52) and (53) imply that the transform of the finite con- 
volution on the left of (96) is the product of those factors of the in- 
finite product (95) which have a subscript greater than n and not greater 
than n + k(n). Hence, if the distribution occurring in the continuity 
theorem at the end of §10 is identified with the finite convolution on 
the right of (96), it is seen from (15) that (96) holds if and only if 
the infinite product (95) is convergent on every fixed bounded y-interval. 
Since this was seen to be equivalent to the convergence of it follows 

that (96) is necessary and sufficient for the convergence of -Kjx . 

It is understood that the finite convolution (96) is meant to denote 
the distribution if k = k(n) is 0. Hence, a necessary condition 

for the convergence of is represented by 
(97) !V-> £ - 



28. It will now be shown that, if is any convergent convolu- 
tion, a point x = s is in the spectrum of the distribution *jx n if and 
only if there exists to x and to every n at least one point s n = s n (x) 
in the spectrum of ^ such that x = s appears in the form of the 

convergent series 
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(98) s = 21 V 

(This does not claim, and it is not in general true, that s n is a 
convergent series whenever s n is in the spectrum of p- n ) . It is clear 
from the definitions of A (+) B and lim A^ in ^ 24, that the assertion 
of the rule (98) can be formulated as follows: If *p, n is a convergent 

convolution, then 

(99) S(%) = lim A n , where ^ = S(^ 1 )(+) ... (+) S(p n ). 

According to $4, every spectrum is a closed set. On the other hand, 

the definition of lim A n , formulated after (73), obviously implies that 
lim A^ is a closed set. But, if \ i n and p in (73) are replaced by 

and -»p, n respectively, it is seen that the limit of the set 
(85) contains the set S(*p n ). Consequently, (99) will be proved if it 
is shown that the limit of the set (85) is contained in the set ). 

The proof will be based on the fact that, if p and X, is any pair 
of distributions, then the inequality 



( 100 ) 



(x-c 



dp *X(x)2 ^ dj; (v) . X dX(u) 

< ' r — \ vl <Jr \u-c|<Jr 



holds for every point c and for every r > 0. For a moment, grant the 



truth of (100) * 

Suppose that is convergent. Then (94) is valid (for every m). 

It implies, in view of (57), that *p n+m — > e as m — ^ oo . According to 
(15), this means that the total variation of *P n+in ( x ) on Eri y fixed inter- 
val -Jr < x < Jr tends to l,as m — ? Qo,and exceeds, therefore, J for 
every m > N, if K = N r is suitably chosen. Consequently, if (100) is 
aonlied to u = p_ * ... * p ..it- follows from (94) that 

and x = "”3 l n+m. 
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I «%<*) >i f 

1 x-c | <r \ x— c( <fr 



d 



h*' 



. .*u (x) 
r m 



when e > N 



where c and r are arbitrary. Hence it is clear from the definition of 
lin A n after (73) and from the definition of S(|x) at the beginning of 
§4, that, if c is so chosen as to be contained in lim A ffl , where 

» then c oust be contained in also. But this 

is precisely that assertion of (99) which remained to be proved. 

In order to verify the general inequality (100), apply (49) to 

(• J q " ! x t V ‘2 = 4. It is then seen, from Fubini's theorem, that the inte- 
gral on the left of (100) is identical with 




d |l(x - u)^dX(u), 
<r x 



co 





Since \ = J > this repeated integral is not less than 

d^j.i(v + c - u)ld X(u) , 

| u-cl <gr’’ \v| <r 

(the contribution of the omitted range |u - c( J> |rr being non-negative). 

Hence, in order to prove (100), it is sufficient to ascertain that, if 
u has any fixed value satisfying \ u - c l < -|r, then the interior integral 
in the last formula line is not less than the first integral on the 
right of (100) . But this is clear from (l) if it is observed that, 
since u is restricted to the open interval having the length r and the 
midpoint c, the open interval having the length 2r and the midpoint 
c - u contains the open interval having the length r and the midpoint 0. 
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29. The necessary arid sufficient conditions of ^ 27 for the 

convergence of *}X n involve convergence properties of the functions oc- 
curring in (95) or in (96). It is therefore of interest that there are 
cases in which the convergence of *1^ is ensured by the convergence of 
certain series involving moments of the first and second order only; so 
that that convergence properties of sequences of functions become re- 
placed by corresponding properties of readily available sequences of 
numbers . 

In this direction, it will be convenient to consider first the con- 
volutions -^ n in which every (ig is centered in the sense of (32) . In 
this case (to which A general case will subsequently be reduced) , a suffi- 
cient condition for the convergence of *1^ can be formulated as follows: 



(ioi) if 2. M < co , and if every |X n is centered, then is 

convergent; moreover, is a centered distribution, and 
exactly ^[^ h] . 



Since 2. W <c * implies that fj-i ^ or every n, it is clear 

that (101) is not a necessary condition for the convergence of a convolution 
of centered distributions. However, the condition <® e7 turns 
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out to be necessary for convergence in a particular case of considerable 



interest. In fact, this will be the case if the spectra of K]_ » 1^2 > • • • 
are uniformly bounded, i.e., if S(|4^) is contained in a fixed bounded 
interval for every n. Thus the assertions of (101) can, in this case, 
be completed as follows: 



(102) If • • • are centered distributions with uniformly bounded 

spectra, then the convolution % is convergent if and only if the numer- 
ical series is. 



In order to prove (101), notice first that, since *Z— Tt^nl 
implies that <0^ , it is the second line of (41) that applies to 

every n. Furthermore, (101) assumes (38) for every n. Accordingly, 

2 _ 

~~ p ^ Jlce = ’^ 1US (52) is applicable to p, = |j, n > 

if k = 2 and, therefore, if k = 1; so that D °(0) = - Ai and 

D u °(0) = 0. It follows therefore from (12) and from the case k = 2 
«/ " 

of (31) that, in view of Taylor's formula, 

^n (5 r ) = l-§ [F n Vy - 2 Y / x 2 (cos (G^ y xy) + i sin (0^ y xy) )d|,i n (x) , 
where the 0 are independent of x and satisfy the inequalities 0 < 0 < 1. 



In particular, | (A n °(y) — 1 1 <2 [^3 Y • Since “2— [ is a convergent 
series of non-negative constants, this inequality implies that the infinite 
product on the right of (95) is uniformly convergent on every fixed bounded 
y-interval. In view of J>27, this proves the first assertion of (101), 



namely the convergence of *p. n . 
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2 

The convergence of and the preceding inequality for the 

o, 

deviation of p n (y) from 1 also imply that there exists an interval 
-a < y < a such that the finite product on the right of (82) deviates 
from 1 by not more than Cy^ if -a < y < a, where both constants a, C 

are independent of the index n occurring in (82). It follows therefore 

from (95) that | (*p n ) (y) - 1 1 < Cy 2 for -a < y < a. In other words, 
the case n = 1 of (34) is satisfied by c^ = 1 and c^ = 0, if p, = *p n » 

It follows therefore from ^12 that there exists a constant C 2 satisfy- 
ing (33 1 ) for k = 2 and that (p)^ < O 0 , finally that (33) is applicable 
to k = 2. But (33’) means, in the present case, that p°(y) = 1 + 

+ °(r ) as y — ^0. If this is compared with the case k = 2 of (33) > 

it follows that c 0 = — 4-(ji) . Accordingly, the distribution p = *p~ 

has a finite second moment satisfying (*u )° = 1 - tK*lO y + ofy 2 ). 

li n 2 

This, when compared again with (33), implies that (■*p n )-^ = 0. In view 
of the definition (38), this means that(#p ) is centered. This is the 
second assertion of (101). 

Since < 00 a^d (*p n ) = 0, the second line of (41) gives 

C*Mn] = (%) . If thi s is substittited into the o-relation just 

found , it is seen that the last assertion of (101) is equivalent to 

the statement that the deviation of the product (95) from 1 is 

-iy 2 z. no 2 + o(y 2 ) as y — r'O. But the truth of this statement is 

r 2 2 

readily verified from the last formula line, since j x dp n (x) = C^ n J 
and. u ^ <00 . 

In order to prove (1C2) , let ^p n be a convergent convolution of 
centered distributions (for the present, the spectra S(p n ) need not be 
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uniformly bounded). Put X n = ^ rtere denotes, as before 

( 17 ), the distribution represented by 1 - fi n (-x) at its continuity points. 

and 

Thus it is clear that ^ is centered A that £ . It follows 

therefore from (64) and (67) that X_ = u * u is centered and that 

Xi _]_ r n r n 

M 2 = 2 W 2 - 

0 1 o 2 

Furthermore, ^ (y) = |(^ (y)| , b y ( 54 ), and so a repeated appli- 
cation of the product criterion of § 27 shows that the convergence of 
implies, on the one hand, the convergence of and, on the other 

hand, the uniform convergence of~|T (y) ° n any fixed bounded y-inter- 
o 

val. But ^ (y) is non-negative, hence real, and so it is clear from the 
case (J. = Xjj of (11) that~["^ X n °(y) must be absolutely convergent because 
of its convergence. Consequently, 21 |l - X^°(y) | < O® for every y, which 
implies, by the case |.i = of (21), that 



f 



-r 



x dX n (x) < O 9 for every r. 



Suppose now that the spectra S(|i n ) are uniformly bounded. Then it 



is clear that the same is true of the "reflected” soectra S( U ), and so, 

_l r n /7 3 

t»y ( 71 ), of the spectra S ( * P- n ) . It follows therefore from 

X n = F- n * p n that there exists an r such that each of the distributions 

X n (x) is constant for - o»< x < -r and for r < x <0* . Hence, the last 

formula line means that (X ) < 00 . But X n is centered, i.e., 

2 ^ 

(X n ) 2 = [xj ; so that % fx ^ 2 <0^ . 2 

This completes the pooof of ( 102 ), since, as observed before, W 

= 2 H • 
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30. It is easy to transcribe the results of $29 to the case 

of distributions which are not centered. In fact, (101) can be general 
ized as follows: 

2 

(103) If 2- [VJ < O' 3 , then the convolution *|A is convergent if and 
only if the series J_ f|i n ) ^ is; in which case p]C] = T and 

(n%>l = ^n) x - 

The corresponding transcription of (102) runs as follows: 



(104) If are distributions having uniformly bounded spectra, 

then the convolution *jx n is convergent if and only if both series 

2 ^ 

fha] > are * 



In order to prove (103) > suppose that 21 <0 0 , Then 

ju~l <<o®; hence (}0 <CP , and so the first moment (|.i ) and the cen- 
L nj 2 ^ 1 

tered distribution ^ n * exist for every n. But the second line of (41) 
implies that ' J , since (f.i n *)- = 0. Furthermore, if c n = (1^)-^, 

then (38) shows that ^ and ( ^ )* = u n * . Hence, if (101) and the 

criterion of (93) are applied to the case in which u> n is replaced by H n ’, 

and if (37) is then applied to \x = *u * and to c = 21 c , where c = (u ) 

11 n' n r n'i 



it is seen that (103) is equivalent to its particular case (101). 
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Correspondingly, (104) follows from (102), since the spectra S(f.i n ) 

are uniformly bounded if and only if the spectra S(jx *) are. In fact, 
it is clear from (30), where (u)^ = 1, that, if the spectrum of a distri- 
bution u is contained in an interval, then the latter contains the point 
x = ([! ) j so that the assertion is clear from (38) . 



31. For every distribution u, define another distribution,)!*, 

*0 *0 * * 

by assigning the transform (X* = \x' (y) of |X* = p/( x ) as the func- 



tion 

(105) 



:o, r - \ i xy 

l- 1 (y) = [f] + J e aj!(x) f Where (\ 4 = 

-1 1 J 



j |!f - const. 

If (12) is applied to both. [X and |X*, it is seen from (10) that the con- 
stant occurring in (105) must be the non-negative constant (90) . Cor- 
respondingly, comparison of (105) with (10) shows that the distribution 
(X* is defined as follows: 

o if lx( > l, 

(106) djx*(x) = ^ dfx(x) if 0 < | x( < 1, 

£ ’fj'j + d[x(x) if x = 0. 



It is also seen from (105) and (10) that 

Oc I 

(107) H*°(y) - |!°(y) = ( j - \ 

-Qo -I 

which implies, by (90), that 

| H- : °(y) - My) I < 2 [|xj . 



)(1 - e ixy ) Mx), 



( 108 ) 



68 



Since the spectrum of J.I* is contained in the interval -1 < x < 1, 
it follows from (104) that, for any sequence of distributions * • • » 

(109) * is convergent if and only if both ^nf and X GOl 
converge. 

Thus there arises the question as to the possibility of a connection 
between the convergence of *n n and the convergence of *p n , where 
is an arbitrary sequence of distributions. In this direction, it is easy 
to prove that 

(110) implies that either both and ^ n * are convergent 
or both are divergent. 

In fact, if and w 2 ,w 2**** are tv/o sequences of numbers 

satisfying z n — ^1 and w n — ^ 0 respectively, then log (z R + w Q ) - log z n , 
being equal to log (l + w n / z n )> is 0(\ w n \) as n — Hence, if IT % 
and^I)w n | are convergent, thenlT (z Q + w Q ) is convergent. Obviously, 
a corresponding statement holds uniformly in y for the case of functions 
z n = z Q (y), w n = v; n (y). Thus, if these functions are given either by 

z n = > z n + w n = ^n’° or z n = ^ n ’°» Z n + w n = 11 is seen froir ‘ 

the product criterion of (95), that (110) follows from (108). 
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32. Let an infinite convolution *p, n be called absolutely convergent 
if *X n is convergent whenever X-j_, i- s a rearrangement of • 

According to the criterion of (95), this will be the case if and only 
if the infinite product TT X n °(y) belonging to an arbitrary rearrangement 
of |i,^,[X2»*«» is uniformly convergent on every fixed bounded y-interval. But 
this is equivalent to the absolute and uniform convergence of the product 
(95) on any fixed bounded y-interval. This, wnen compared with (19) and with 
the fact that the absolute and the un condition convergence of a product TT§n 
are equivalent notions, implies that a convergent convolution is absolutely 
convergent if and only if the distribution *X n is identical with the distri- 
bution whenever the convolution *X n is a convergent rearrangement of the 
convolution %• 

Since the absolute convergence of TTg n is equivalent to I 1 - g n I < o o 
it also follows that *p, n is absolutely convergent if and only if Zu-^wi 
is uniformly convergent on every fixed bounded y-interval. 

If this interval is chosen to be -1 < y <. 1, and if (22) is applied 
to r = 1 and to |X = |A n , it follows, in particular, that the convergence of 
the non-negative numerical series , where is defined by (90), 

is a necessary condition for the absolute convergence of Hence, it is 

clear from (110) that the absolute convergence of *p. n implies the absolute 

» • 

convergence of and that the absolute convergence of *p. n * implies the 



70 



absolute convergence of -*i r i n whenever r #*)<«•. it follows there- 
fore from (109) that the simultaneous satisfaction of the three condi- 
tions 

(111) < oo , '2__ J <c o, < 2J\ ( 1 ^*)^ <oo 

is necessary and sufficient for the absolute convergence of an infinite 
convolution *p n . 

If a convergent product "\V gjj is written in the form I (l - 1^), 
and if _< 1 for every n, then, since > 0, it follows that IT gjj is 
absolutely convergent. If this remark is applied to = p^°(y), it is 
seen from the case j.i = ^ of (11) that, if transforms jt n °(y) are real- 
valued for every y (which means that the distributions are symmetric; 
cf. § 67 below), then the convolution cannot converge unless it con- 
verges absolutely. 



33. It turns out that^_-£p^ < c* is a necessary condition 
for the mere convergence of However, this fact is substantially 

more recondite, that is^ its particular case considered in § 32, where *p n 
is assumed to converge absolutely. In fact, it will be convenient to de- 
fer the proof to £ 39 , where use will be made of the theory of product 
measures on infinite product spaces, rather than of the theory of the 
transform (10) . 
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If the necessity of <0® ? or the convergence of has 

been proved, then (109) and (ll) clearly imply the following " three-series 
theorem": An infinite convolution is convergent if and only if all 

three numerical series 

(112) <(Vn} 1 j^n^] > ) 1 

are convergent. The easier criterion of ^32 for the absolute convergence 

of *p n , namely the criterion (ill). Is of course a particular case. Need- 
less to say, the terms of trie first two of the three series (112) are non- 



negative for every #p, n . 

The definitions (90) and (106) introduce into the t'nree-series theorem 
the absolute constant r = 1. Actually, the theorem remains valid if r = 1 
is replaced by any fixed r > 0 in (90), (106) and (112). In fact, (90) and 
(106) show that this replacement is equivalent to a change of the unit of 
length on the x-axis and therefore, in view of (17), to the reciprocal 
change of the unit of length on the y-axis. But the product (95) is uni- 
formly convergent on every fixed bounded y-interval if and only if the same 
is true of the product |i n (ry) for every fixed r > 0. Since both this 
product criterion and the convergence of all three series (112) represent 
necessary and sufficient conditions for the convergence of , the asser- 
tion follows. 

A curious corollary is that there exists for every convergent convolu- 
tion *u (x) a convergent series 7_ c such that the convolution #u (x - c ) 
n n r n n' 

is absolutely convergent. In order to see this, it is sufficient to place 
c n = -(jl n *) and to compare the three-series theorem with the criteria of 
(111) and (93); cf. (38) and (40). 
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A consequence of this coroUary is that, if (A = *jj. n is a convergent 
convolution and if X = *X n in any of its convergent rearrangements (cf. £ 32), 

then there exists to this rearrangement a constant c such that X = [A, 
i.e., X(x) = n(x - c). In order to see this, it is sufficient to choose 
convergent series z v ^ % which make the infinite convolutions 
^n^ x “ *\ 1 (x ~ b n ) absolutely convergent, and then to apply the 

remarks leading to (92) and (93) J cf. (86) and (87). 



73 



Convergence in Measure 

34. Let T be a "space" of "points" t. Let there be given on 
T a field of sets F each of which consists of points t, and suppose that 
every point t and the space T are sets F (i.e., sets contained in the field). 
Suppose finally that there is given on the field a measure 0 = 0(F) for 
which the measure of T is 1. In other words, let there be given on the 
field a function 0(F) satisfying, corresponding to (1), (2) and (3), the 
conditions that 0(F) 0 for every F, that 0(F 1 ) + 0(F 2 ) + ... = 0(F 1 + F 2 + •••) 

for every sequence of mutually disjoint sets F^,F 2 , . .., finally that 

(113) 0(T) = 1. 

Let there be given on T a function x(t) which has values represented 
by points of the line X: -0**< x < O’ and which, with reference to the 

given field on T, is a measurable function of the position t on T. It is 
understood that x(t) need not be defined, or can be + on a t-set of 
0-measure 0. 

It is easy to see that the assignment 

(114) = 0(F ) 

defines on the field of the Borel sets E on the line X a set-function 

which is a distribution in the sense of §1. It is understood that the 
symbol within the parenthesis on the right of (U4) denotes the set of 

those points t of T for which x(t) is a point x of E. Thus it is suffi- 
cient to show that, if E* denotes, as at the beginning of £5, the half- 
line consisting of those points of the line X which are to the left of 
the point x, then (114) defines for every E = E* a value which, when de- 
noted by |i(x), satisfied (8). But the expression on the right of (114) 
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is defined for every E = E X , the function x(t) having been supposed to. 

be (^-measurable on T. And the three conditions (8) for |X follow from the 
three conditions assumed, before and by (113), for the measure 0 on T, 
the function x(t) having been assumed to represent a real number x (^ + o» ) 
for almost all points t of T. 

The distribution \L = or \k = (x(x) which is defined by (114) on X 
will be called the ^-distribution of x(t) . It is characterized by the 
property that, if f(x) is any Baire function on X satisfying either 
|f (x) l < const, or f (x) > 0, then 

(115) ^f(x) = ff(x(t)) <30. 

X T 

In fact, (115) follows from (114) in either case, while (114) follows from 

(115) by placing f(x) =1 or f(x) = 0 according as x is or is not in the 
Borel set E. 



35. Let x-^(t), x^t),... be a sequence of functions on T such 
that every x R (t) satisfies, with reference to a fixed 0 = 0(F) on T, the 
same conditions as the function x(t) in £ 34. 

If there exists for every constant q > 0 and for every constant p > 0 
an N = N ^ such that the set of those points t of T at which | x(t) - x (t )/ 
exceeds p has a 0-measure less than q whenever n > H, then x^(t) is said 
to tend (on T) to x(t) in 0-measure, as n — > Q® . 
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It is known that, if x^(t) tends to x(t) in 0-measure, then the 

sequence x^(t), Xgft),... contains a subsequence, say x^(t), x^Ct),..., 
such that ('b) — ^ x(t) as n — 9 holds for every fixed t not contained 

in a certain t-set of 0-measure 0. This implies that, if x^t) tends to 
x*(t) and also to x n (t) in 0-measure, then x' (t) = x w (t) holds for almost 
all t (that is, for every t not contained in a certain t-set of 0-measure 0). 

If the functions x n (t), instead of tending to a function x(t) in 0- 
measure, have the property that there exists for every constant q > 0 and 

p 

for every constant p > 0 an N = such that the set of those points t 
of T at which / x^t) - x^Ct) | exceeds p has a 0-measure less than q when- 
ever m > N and n > N, then the sequence x-^(t), x^t),... is said to be 
convergent (on T) in 0-measure. 

It is obvious that every sequence x-^(t) , x^(t) , . . . which tends to a 
function x(t) in 0-measure is convergent in 0-measure. The fundamental 
fact (compactness theorem) of this theory is known to be the converse of 
this obvious remark, namely the fact that every sequence x^ (t), x^t),... 
which is convergent in 0-measure tends to a function x(t) in 0-measure. 

In order to establish a connection between these notions and the 
definition (114), suppose that x^t) tends to x(t) in 0-measure. This 
means that the set of those points t of T at which \ x^(t) - x(t) | exceeds 
a fixed s > 0 tends to 0 as n — * Oo . Hence, if n n and [ 4 , denote the 
0-distributions of x Q (t) and of x(t) respectively, then, since 0 is fixed, 
it is readily seen from the definition of a 0-distribution ( £ 34) > that 
[x n — > pi holds in the sense defined at the end of § 2 . This, when combined 
with the compactness theorem quoted before, proves the following fact: 
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(116) If a sequence x^(t), x^t), is convergent (on T)in (^-measure, 

and if [Ajj denotes the distribution on X which represents the ^-distribution 
of x^(t), then there exists on X a distribution [A satisfying |A n — ^|A> in 
fact, this (A is precisely the ^-distribution of the function x(t) to which 
x n (t) tends( on T)in ^-measure. 



However, (116) supplies only a necessary, and not a sufficient, con- 



dition for convergence in ^measure. In order to see this, let T be the in- 
terval -4- ^ t <.^ and 0 the Euclidean measure on it, finally x^(t) = (-l) n sgn t, 
Then ( x^t) - x fi+ ^ ( t ) \ = 2 holds for every n and for every t f- 0, and so the 
sequence x^(t), x^(t) , . . . cannotjconverge on T in ^-measure. Nevertheless, 
there exists a distribution [A satisfying [a r — [A, since |A n is independent of 
n (in fact, ^(x) is 0, \ or 1 according as x < -1, Ixl <1 or x > 1 ). 

In view of this failure of the converse of (116), it is a fundamental 
property of the particular case of infinite produce measures, to be defined 
in § 37, that the converse of (116) will, in £ 38, turn out to be correct 
for this particular case. 



36. The meaning of the convolution of distributions |A on X 
can constructively be described in terras of the notions of ^34» if use is 
made of the idea of product measures. 

In order to see this, let T^,T2 ,... be spaces of respective points 
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t^t^,... and let every T n carry a measure <£ n = <£ n (F n ) on a field of sub- 
sets F n of T n in such a way that all the conditions required before and by 
(113) in £34 are satisfied for every nj in particular, 

(U-7) 0„( T n ) = 1. 

For a fixed n, let F 0 = F-^.F^. ... . F n denote the "Cartesian product" 
of sets F^,...,F n which are chosen in the respective fields on T^,T 2 ,. . .,T n » 
Thus, 

(118) T* = T x T n 

denotes the space in which an arbitrary point 

(119) t n = t x t n 

has the coordinates t^, ...,t n ; coordinates which are arbitrary points of 
Tp . . . ,T n respectively. 

n n n n 

Consider on the product space T the "product measure" <fi = (F ), 

defined by placing 

(120) <Af ) = (^(F,) 0(F 2 ) ... 0 (F ) if F n = F, .F 0 F . 

nn lx: n 

The collection of those sets in Tti 

which are representable as product sets 

F^ does not in general form a field. However, this collection can 

always be extended so as to become a field and to possess the extremal 
property that, if a set of points (119) is in the field and has the form 

G.F 2 .F 3 F^, where G is any set of points t^ on T^, then G is a set 

F^ contained in the given field on T^; and that the same holds if is 
replaced by T^, . . .,T q . In the sequel, F D will denote any set contained 
in the resulting field of sets of points (119) . It can be shown that the 
assignment (120) defines on the field of all these sets F° a unique measure 
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n 

0 . Finally, since (119) holds for n = 1,2,..., it is clear from (117), 

(118) and (120) that, corresponding to ( 113 ), 

n 

(121) 0 (T ) = 1. 



For every n, let there be given on T_ a <b -measurable function x (t ) 

n n n n' 

of the position t n , where it is understood that x n (t n ) need not be defined, 
or can be + o° , on a t n ~set of 0 n -measure 0. Then, in virtue of (119), the 
function represented by the sum 

(122) x n (t n ) = x^tj,) + ... + x n (t n ) 

n 

is a function of the position t n on the product space T . Clearly, the func- 
tion x n (t“) has, with reference to the product measure 0° on T 0 , all the 
properties required of x^t ) on T with reference to 0 . Hence, there exists 

ii n 

on X a distribution, say representing the 0° -distribution of x (t ) on 
T**, as defined in £34. It will now be shown that, if n n denotes, for 
n = 1,2,..., that distribution on X which is the (^-distribution of x n (t n ) 
on T n , then 



n 



( (123) H' = ^ * ... * \i n . 

In fact, let T, t, 0, x(t) in (115) be respectively identified with 



(118), (119), (120), (122), and therefore \x with the 0 n -distribution of 

n, n n n 

x (t ) on T t i.e., with n . Finally, let the function f(x) occurring in 
(115) be chosen to be 1 or 0 according as x is or is not in E, where E is 
a fixed Borel set on X. Then it is clear that (115) becomes 



(124) 



n 

^E = 



f ...f 



••• 



*i(t 1 H...+x 11 ( t n ) in E 
where the n-fold integration is extended over the set of those points (119) 
of (118) which satisfy the restriction specified under the integral signs. 
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Now let (115) be applied to each of the functions x(t) = x^(t^) , . . ,,x n (t n ) 

of the positions t^,..., - ^ on the respective factor spaces T^*...,^ and to 
the respective measures - c^,...,0 n on these spaces. Then it is seen from 
(12) and from Fubini* s theorem that (124) can be written in the form 

(125) [i“ = ; -I djl^ (x^) ... d|A^ (x n ) . 

x.+...+x in E 
1 n 

But (43) shows that (125) is identical with the definition of|A^*... *|^ 
for n = 2 and therefore, as observed after (52), for every n. This proves 
(123). 



It should be noted for later reference that, if r is any positive 
constant, 

( 126 ) 



[ %. * ••• * dn 1 (x 1 ) ... 

|x 1 +...+x n l > r 



\x\ > r 



W = 



* • < fin -measure of the (t^,...,t n )set on which (t^)+. ..+3^(t n ) / > r 

This follows by applying (125) and (124) to the Borel set E consisting of the 
two half-lines exterior to the interval -r < x < r. 



37. Let 

(127) T = T r T 2 . ... 



and 
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(128) t = t r t 2 . ... 

denote what result by letting n — 9 in (117) and (118 ) j so that t is a 

point of the infinite product space. Correspondingly, let F denote any 
set in a certain field of sets in T; a field which contains every infinite 
product set F^.F^. . .., where F n is a set in the field on . It is known 
that, due to (121) , it is possible to define on T a measure 0 = 0(F) by 
placing, corresponding to (120), 

(129) 0(F) = 0(F X ) 0(F 2 ) ... if F = F 1 .F 2 . ... j 
so that, in particular, 

(130) 0(T) = 1, 
by (127) and (117). 

The function (122) of the position (119) on (118) can be thought of 
as a function x D (t) of the position (128) on (127), since the assignment 

(131) x n (t) = * 1 (t 1 )+...+3C n (t n ), where t = t^t-,. ...» 

defines on the infinite product space a single-valued function j a function 
satisfying 

n n . 

(132) x (t 1 ) = x (t n ) whenever t ! m = t t! m for every m £ n. 

In particular, the n-th term of (131) , a term which in § 36 was given as a 
function of the position on the n-th factor space T n , becomes a function 

x^t) of the position on the infinite product space T: 

(133) ^(t) = x^^), where t = VV 

it being understood that, corresponding to (132), 

(134) Xj^t’) = x n (t") whenever t* n = t M n . 
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^ n 

In y 36, the distributions |i and \i on X were defined to be the 

^-distribution of ^(tjj) and the 0° -distribution of x (t**) respectively. 
In view of (133), (134), (129), (130) and (117), the first of these defi- 
nitions is seen to be equivalent to the assignment 

(135) [Aq = ^-distribution of ^(t), 
and the second, by (132) and (131), to 

(136) |x° = (^-distribution of x** (t) , 

where 0 = 0j cj^. ... and t = t^^. ... in both cases. In particular, the 

following 

second line of (126) can be written in the A formj 

(137) 0-measure of the t-set on which x^(t) + ... + x^t) > r. 

In view of the result (123) of $ 36, where n < , it is worth em- 

phasizing that all of the above remarks involving (127), (128) and (129), 
where n =o° , hold for any sequence of measures 0^, . 02 , ... satisfying 
(117) J so that the resulting distributions (135) on X need not be such 
as to make the infinite convolution convergent, i.e., such as to lead 
to an extension of (123) to the present case, n = o® . Correspondingly, 
the series ^.x^t), in which the terms ^(t) are defined by (1 33) and 
(134) as functions of the position (128) on the infinite product space 
(127), has an n-th partial sum, x n (t), which in no sense need tend to a 
function , say x(t), as n — j in fact, the terms of this partial sum, 
which is the sum (131) , are allowed to be arbitrary within the trivial 
limitations specified before (122). 
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38. It will now be shown that the question of whether or not the 
series x^t) defines on the infinite product space T a function, say 
x(t), depends on the convergence or divergence of the infinite convolution 

An appropriate way of defining an *(t) turns out to be the convergence 
of ^ x (t) ■^ n <£~ffleasure, where 0 is the infinite product measure (129) on 
the infinite product space T of the points t = t]_.t2» .... It is understood 
that ^x^t) is said to tend to x(t) in 0-measure (on T) if the n-th par- 
tial sum of 21x (t), that is, the function x**(t) on T, tends to x(t) in 0- 
measure (on T)j cf. ^35. Then the connection just mentioned can be formu- 
lated as follows: 

(138) If the distribution ^ = |j, n (x), where - 00 < x <o» and n = 1,2,..., 
is the (^-distribution of the function x n (t n ) of the position on T n , and 
if 0 denotes the infinite product measure 0j_ 0 ^ ... on the space T = T^.Tq. ... 
of the points t = t-i.tp. ..., then the series x (t), where x (t) is de- 

fined by (133) » is convergent (on T) in 0-measure if and only if the infinite 
convolution is a convergent infinite convolution; in which case the dis- 
tribution on X is precisely the 0-distribution of the function x(t) to 
which the series ^ x Q (t) converges (on T) in 0-measure. 

In order to prove (138), suppose that ^i n is convergent. The defini- 
tion (15) and the necessary and sufficient criterion (96) show that this will 
be the case if and only if the total variation of the distribution |^ n+ ]_ * ... 

*u on the two half -lines exterior to any fixed interval -r < x < r tends to 
n+k - - 

0 as n — *0», where k = k(n) is any non-negative function of n. It is clear 



83 



from (48) and from the remarks following (52), that this total variation 

is represented by the integral of d H > n+1 (^ +1 ) ••• * extended 

over the manifold of those points of the k-dimensional Euclidean (x n+ ^> . . . , 
x n+k ) -space which satisfy the inequality 1 x^ + -^ + ... + x^jJ > r. Hence, 
if the n indices l,...,n occurring in the identity (126) and in the identi- 
cal transcription (137) of (126) are replaced by the k indices n + 1, ..., n + k, 
that 

it follows A the total variation of the distribution H n+ Q_ * ... * H'n+’j on the 
two half -lines exterior to any fixed interval -r .< x < r is identical with 
the 0-measure of the set of those points t of T which satisfy the inequality 
I z n+l^t) + ••• + x n+k (t) | > r. This inequality can be written in the form 

| x n+k (t) - x* 1 (t) ( > r, since x (t) denotes the n-th partial sum of the series 

Consequently, the infinite convolution is convergent if and only 

if the 0-measure of those points t which satisfy the inequality | x B+k (t) - x n (t)]> r 

tends to 0 as n — > o» , if r > 0 is arbitrarily fixed and k = k(n) is any 

non-negative function of n. But this means, by ^ 35, the convergence (on T) 

1 2 

in 0-measure of the sequence x (t), x (t),... on T. Since this sequence is 
the sequence of the partial sums of the series x^(t) » the proof of the 
necessary and sufficient criterion, asserted by the first part of (138), is 
complete. 

In order to prove the remaining assertiai of (138) , suppose that 

is convergent and denote by x(t) the function to which the partial sums, 
x^t), of 2Z x^t) tend on T in 0-measure. Then, if |>t and [,l n denote the 
(^-distributions of x(t) and x°(t) respectively, it follows from the last 
of the assertions of (116), that |4. n — ? |X. This means, by (136) and (123), 
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that [A = -K tA n > which proves the last of the assertions of (138). 

Remark . Suppose that the infinite convolution is convergent, 
i.e., that the series x^t) i s convergent in 0-measure. Then, as men- 

1 p 

tioned in $35, the sequence x (t), x^(t),... of the partial sums contains 

a subsequence which converges at every point t not contained in a certain 

the 

set of 0-measure 0. Actually, it can be shown that, due to^particular 

structure (133) > (134) of the terms of Z x (t), it is unnecessary to 

* — n 

select a subsequence, i.e., that convergence in 0-measure implies converg- 
ence almost everywhere in the present case. This fact, which is a straight- 
forward consequence of the definition of a Lebesgue integral with reference 
to an infinite product measure, will not be proved here, since it is irre- 
levant for the theory of the distributions [A on X or of their transforms (10), 



39. As an application of (138), it will now be proved that, as stated 
at the beginning of §33, 

(139) Z if % is convergent. 

In terms of the functions x^(t), x^(t),...and of any fixed constant c, 
define the functions u^t), u 2 (t),... by placing 

\(t) if \x n (t)|< 1, 



(140) 



u n (t) = 



c if\Xj,(t)| > 1. 
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Then it is clear that (134) remains valid if is replaced by This means 

that the function u^t) of t = t-^.l^. . .. can be thought of as a function, 
Uj/tjj), of the n-th coordinate, t^, of the point t of T = T^^. . ..; so that, 
corresponding to (133), 

(141) u n (t) = where t = t-^.t 2 

Hence it is easy to show that 

(142) if ^(t) is convergent in ^-measure, then \ u^, (t) is, 

where 0 denotes the product measure (129) on (127). 



In fact, suppose that x^(t) is convergent (on T) in 0-measure. 
This means, by £ 35, that there exists for every q > 0 and for every p > 0 

p 

an N = such that the 0-measure of the set of those points t at which 

K(t) + ... + x m (t)l exceeds p has a 0-measure less than q whenever 

m > n > N. In particular, the set of those points t at which |xj(t)| 

exceeds p has a 0-measure less than q whenever j > N. This, when combined 

with (l40), shows that the set of those points t at which u. (t) = x.(t) is 

J J 

violated by at least one j > N has a 0- measure less than q, provided that 
p is chosen sufficiently small (p < 1). It follows therefore from the con- 
dition defining N = , that the set of those points t at which l u n (‘t) + 

... + u (t) | exceeds p has a 0-measure less than 2q whenever m > n > N, 
provided that p is sufficiently small. This, when compared with £>35, com- 
pletes the proof of (142) . 

According to (133) and (141), both x^ and are functions of the 

position t on the space T„ on which the measure is 0 = rf> (F ) . Hence, if 
n n r n na n 

H n = [A n (x) and = X n (x) respectively denote the 0 n -distributions of ^(t^) 
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an ^ Ujj ( t n ) , it is seen from ( 140 ) , by applying the definition ( 114 ) once 
to (J, = (J, n , 0 = 0 n> x ( t ) = 3^ and once to |i = ^, 0 = 0 ^, x (t) = u n , that 
X n (x) is the distribution for which 



dy x ) = 



0 if | x | > 1 and x/c, 
djijjCx) if | x| <_ 1 and x/c, 

+ if x = c, 

where denotes the non-negative value defined by (90). In particular, 



the spectrum S(X n ) is situated between the two points x = +(l + | c|) for 
every n. It follows therefore from (104) that a necessary condition for 
the convergence of *X n consists in the convergence of the series (\j)^» 
where (X R )^ = j x dX Q (x) , by (30 ) 5 so that, according to the preceding 
representation of dX ( x ), l 

r 



&n) 





xdu ). 
J n 



-1 



Suppose now that, as assumed in (139), the convolution * 1 ^ is con- 
vergent. Then, if (133) is applied to both series ^ x n (t) , ^ u n (t) , 
it follows from (142) that the convolution *X n is convergent. Hence, the 
series (X^)^ must converge. Consequently, if the arbitrary constant c 
occurring in the last formula line is chosen to be c = +1, it follows by 
subtraction of the resulting two series ^ (X n )^, that 2 * t>e i n S 

the difference of these convergent series, is convergent. Since > 0, 
this is identical with the assertion of (139). 
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40. This proof of (139), or any other application of (138) to in- 
finite convolutions, assumes that, if a sequence of distributions . . . 

on X is given, it is possible to find spaces on which there 

exist functions x^t^, x,^),... and measures 0^(F^) , 311011 

that (117) is satisfied and (J^ becomes the (^-distribution of x^t^) . 

But this situation can be realized for any given sequence • •• ^ 

placing 

(U3) ^(^n) = Xq, t n = x n , ^ = ^ n . 

In other words, the sequence of the spaces is chosen to be iden- 
tical with a sequence X-^: -o«< x^ <c/> $ Xj : -o»< 3^ <®» ; ... of 

copies of the line X : -©*< x <o° , and the given distribution 
[4^ on the line X is chosen to be the measure ^ on T n = Xq 

(cf. (1), (2), (3) and the assumptions made at the beginning of £34)» It 
is understood that the field of the (^-measurable sets F n on the space T n 
becomes the field of the Borel sets on the line X^. Thus the infinite 
product measure (129) is defined by the condition 

(144) <£(%. Ejj. ...) = ^(E^) <^( Eg) ..., where 0 n = 

and will, in accordance with (91), be denoted by <p = H n . It applies to 

sets contained in a field generated by the sets ••• of the infinite 

product space X^.X^ The latter is the Cartesian space with an in- 

finite sequence of coordinates, one point of this space being 

(145) x l ,x 2* ••• = ( x i»* 2 , •••), where - o®< Xjj < 0 ® . 
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It is clear from the definition (114) of a (^-distribution, that 
(138) can, in the present case, be formulated as follows: 

(146) An infinite convolution is convergent if and only if the series 
y* Xjj is convergent in \\ ^-measure on the infinite product space X-^.X^. ... 
of the points (145) > where *\"f denotes the infinite product measure de- 
fined by (144). If *]X n is convergent, then the distribution [X = |Xg or 

fx = |x(x), which is represented by * 1 ^ on the line X: - ©°< x <«» ,can be 

obtained as follows: The function 3^, to which the series ^ con- 

verges on X 1 .'X^. ... in 7T [X^-measure, attains values x = * n contained 
in E on an (x-^,X 2 >»»»)-set having the "[J ^-measure where E is any Borel 
set on the line X. (it is understood that the function remains tin- 

determined on an (x^,X 2 »...)-set of \\ ^-measure 0). 

The assertions of (146) can in various ways be combined with Borel* s 
celebrated alternative of "either everything or nothing" •, in fact, when such 
a combination is possible, (146) supplies a corresponding alternative with 
regard to properties of infinite convolutions *jx . An instance of results 
of this type will be given in £ 41 . 

For the applications in question, Borel* s disjunctive principle can 
be formulated as follows: 

(147) Let "(X |x n denote the infinite product measure on the infinite product 
space Xjl»X 2 . ... of the points (145), where [X^,^,... is any sequence of dis- 
tributions on a line X (the infinite convolution need not be convergent). 
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Let Q and R be two complementary "qualities" such that, except for an 
(xi> x 2 ,,# *) -set 0 f TT Rn-oeasure 0, every point (x^x^,...) of ... 

possesses exactly one of the two properties Q,R. Suppose further that, 
if (x^', x^',.*.) and (x^n >x ^«, ...) are two points not on the excluded set 
of 1'T ^-measure 0, and if there exists to these two points an N such that 
x n ' = x n " for every n > N, then either Q or R is assigned to both points. 
Finally, let (Q) and (R) respectively denote the set of those points 
(x^,X2,...) of ... to which Q and R are assigned. Then (q), hence 

(R) , must possess either the 1 I ^-measure 0 or the (^-measure of the 
whole (x 1 ,X2,...)-space, i.e., the ”\T [^-measure 1. 

The proof of this alternative depends on a detailed inspection of 
the constructive definition of an infinite product measure? it will not be 
given here. Incidentally, the fact quoted in the Remark at the end of ?38 
can readily be obtained from this alternative. 



41. As an application of (146) and (147) together, the following 
fact will now be proved: 

(148) If every distribution occurring in a convergent convolution *)A n 
is purely discontinuous, then the distribution is pure (it can be any of 
the three pure types defined in ^4). 
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First, the assumption that every [X^ is purely discontinuous means, 
t>y ^4, that, if P n ^, Pjj 9 • • • 3.S the (infinite or finite, but certainly 
not vacuous) sequence of the points x contained in the point spectrum, 

P(fX n ), of (Xq, and if 0 = <^(E) denotes the set-function |X = [X^, belonging 

1 2 

to the point- function fx = jx(x) when |x = jXjj, then ^(p^ i ) + 0 n (P n ) + ••• = 1 

holds for ever y n (the points x = p Pn^» ••• are ^ ere thought of as Borel 

sets E on the line X : - © t> < x <o»). Since, according to (143) and (138), 

the (^-distribution of x^ is this implies that the set of those points 

1 2 

x of the line X which are distinct from all the points x=p n , p n , ... 
is of (^-measure 0 for every fixed n. 

Let M denote the least modul containing every point of every P(|x n ). 
This means that M is the least (at most enumerable) set containing the 
value x = 0 and every value x = p n , and having the property that it con- 
tains the sum and the difference of any two values contained in it. Thus 
it is clear that, if M(+)E denotes, as in § 24, the vectorial sum, then 
M(+)E is at most enumerable whenever E is, and that M(+)E is of the Euclidean 
measure 0 whenever E is ( by " Eu c 1 i d e an measure” is meant the ordinary Lebesgue 
measure on the x-line). 

Suppose now that is convergent. Then, according to (146), the 
series x R converges on X^ . ... in TT d> n -measure, where cb^ = p n , and 
the function ^ of ( 145 ) , thus defined on X^.Xp. ...» has the T7*0 n “ 
distribution *p n . On the other hand, the purely discontinuous distribution 

^n = on X is the ^-distribution of the function x n of the position x^ 
on Xjj. Hence, it is clear from the definition of the modul M, that the set 
M(+)E belonging to an arbitrarily fixed x-set E has the following property: 
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If the function x n attains at two points (145) of the infinite product 

,...), the values 

x* and x w respectively, end if the two points are such that there exists 
an N for which x^’ = x^" holds whenever n > N, then the value x = x' cannot 
be contained in the x-set M(+)E unless the value x = x" also is contained 

in M(+)E. This makes applicable the alternative expressed by the assertion 
of (147). 

In fact, as pointed out before, M(+)E is at most enumerable whenever 
E is, and M(+)E is of Euclidean measure 0 whenever E is. Hence, *p n being 

the \\ ^-distribution of the function ^ on it is clear 

from (147) and from the definition of a 0-distribution (cf. |>34)» that, 
if E is either at most enumerable or of Euclidean measure 0, then the dis- 
tribution p = jig represented by p = #1^ attains for the set G = M(+)E a 
value which must be either 0 or 1. Furthermore, since the point x = 0 
is contained in M, it is clear from the definition of a vectorial sum ( j> 24), 
that every E is a subset of the corresponding G = M(+)E. Thus it is clear 
that there are only the following possibilities left: The distribution 

[a = fij,, where p = '^p^, attains 

(i) the value p^ = 1 for a suitable enumerable set E; 

(ii) the value 0 for every enumerable set E but the value fJg = 1 for a 
suitable Borel set E of Euclidean measure Oj 

(iii) the value C for every Borel set E of Euclidean measure 0. 

■Since (i), (ii) and (iii) are precisely the properties which, accord- 
ing to £ 4 , define a distribution p to be purely discontinuous, purely sin- 
gular and absolutely continuous respectively, it follows that p = -fcp^ is 



I 1* X 2' 



• • • 9 



say at its points (x^>, x^',*-*) and 
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pure. The remaining assertion of (142), namely that all three cases can 
actually occur, is shown by simple examples j cf., for instance, $ 62 
below. 
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Semi-groups 



42. If denotes, as in (123), the n-th partial convolution of an 
infinite convolution *p, n , then [j. n+1 = | 4 , n * H n+1 , by (52). In what follows, 
this process will be extended to the case in which the index n is replaced 
by a continuous parameter. The distribution sequenees (123) relate to the 
resulting continua of distributions in the same way as the multiplication 
of numbers relates to the multiplication of elements in a one-parameter 
semi-group. 

As at the beginning of §2, let X be any non-decreasing bounded func- 
tion, normalized by 

(149) X(- o® ) = 0 and X(x - 0) = X(x) ; X(o» ) <0° 

(it need not be a distribution, since (149) allows that the second of the 
three conditions (8) be violated by [J, = X ) . It will be shown that, whether 
X is or is not a distribution, there exist to X two distributions, say 
and [ijj, such that, if and are denoted by exp X and Exp X, the trans- 
forms Hj°(y) = (exp X) (y) and (y) = (Exp) (y) have logarithms given by 

(150) log (exp X)°(y) = S (e iXy - 1) dX(x) 
and 

(151) log (Exp X) (y) = Jx - 1 - ixy) dX(x) 

respectively. It is understood that, corresponding to the relation 

•-2 z 

z (e - 1 - z)— where z— ^0, the integrand of (151) is meant to be 

(152) x~ 2 (e iXy - 1 - ixy) = -|y 2 if x = 0 

(whether y ^ 0 or y = 0); so that the integrand of (151) is a continuous 
function of x and y together. Needless to say, exp and Exp denote operators. 
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turning the function X = X(x) into certain distributions, [Xj = |x^(x) and 



|X^ = |x^(x). On the other hand, log w in (150) and (151) denotes, of 
course, a logarithm of the complex number w, and the branch of the loga- 
rithm is assigned by the single-valued expressions on the right of (150) 
and (151). 



43. For instance, if X = e, the assertions are that there 



exist two distributions \l, say [X = if = tf(x) and |x = 9 = 9(x), for which the 
transform (10) becomes 

J-7 l 

. o e — 1 

(153) « (y) = e , where a = exp e, 

and 

2 

(154) 9°(y) = e , where 9 = Exp s, 

respectively. In fact, if X is the unit distribution (15), then, while the 



ly ^ 

integral (150) reduces to e - 1, the integral (151) becomes -gy in virtue 

the 

of (152). Actually distributions to be defined by (153) and (154), the 
"Poissonian" distribution |X = # and the "Gaussian" distribution [X = 9, turn 
out to be such as to lead, by a process of stratification, to the distri- 
butions exp X, Exp X belonging to an arbitrary X = X(x). In fact, it will 
be shown that exp X and Exp X result from a and 9 for every X, if an averag- 
ing process (to be carried out with reference to an arbitrary weight-function, 



2 . 
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depending on the arbitrary \) is combined with the two trivial processes 

defined by (17) and (l6)j and so, in particular, with the convolution 

process belonging to the translated unit distribution 
C CO 

(155) e(x) = e(x - c), where log e (y) = icy. 

The existence of the two fundamental distributions, it = exp s and 

6 = Exp e, can be proved directly, as follows: If n runs through all non- 

negative integers, the sum of the reciprocal values of the positive num- 
bers nle is 1. Hence, there exists a distribution jx = [i(x) which has at 
x = n the jump l/(n!e) and is constant for - oo< x < 0 and for n < x < n + 1, 
where n = 0,1,... . According to (10), the transform |^°(y) of this purely 
discontinuous distribution is the sum of all terms e^ n ^ r /(nle). Since this 
infinite series has the sum e w /e, where w = it follows that (153) is 
satisfied by the distribution just defined. This proves the existence of 
the distribution it. It also follows that, the function (153) of y being 
regular-analytic at y = 0, the distribution a has finite moments (tf)^ of 
arbitrarily high order (cf. the beginning of § 16) . In particular, from 
(32) and (153), 

(156) («) 1 = 1, (#) 2 = 2j hence = 1, 

by (40). Moreover, #*(x) = #(x - 1), by (38). According to (16), this means 
that it*°( y) = e^^?(y) . It follows therefore from (153) that #*°(y) = 

1 ~ h + °(y ), hence log *’°(y) = -|y 2 + o(y 2 ), as y_ *0. This obviously 

o -4 

implies that, as n — * Co , the function n log (n ^y) of y tends to the 
function -^-y uniformly on every fixed bounded y-interval, which means that, 

.0 -4 

on every such interval, the n-th power of the function it (n 2 y) of y tends 
to the exponential (154) « It follows therefore from the continuity theorem, 



formulated at the end of / 10, that, in order to prove the existence of a 

distribution 9 satisfying (154) » it is sufficient to ascertain that the n-th 

.o — i 

power of the function a (n y) of y is the transform of a distribution for 
every n. But this is clear from (82) and (17), since the function #*°( y) of 
y is the transform of a distribution, a' . Thus the distribution 9 appears 
reduced to the distribution a, (in this regard, it is irrelevant that 9 
will, in | 54 , be given explicitly) . 



44. If X(x) = const., then, on the one hand, const. = 0 in view 
hand, 

of (149) and, on the other A both integrals (150), (151) vanish for every y. 

It follows therefore from (15) that, if X(x) = const., the distributions as- 
signed by (150) and (151) are 

(157) exp 0 = e, Exp 0 = e. 

If the existence of the distributions assigned by (150) and (151) has 
been established for the non-decreasing bounded functions X = X-j_ and X = X£> 
then the existence of the distributions assigned for the non-decreasing 
bounded function X = X^ + X 2 follows from 

(158) exp(X x + X 2 ) = (exp X 1 )*(exp X 2 ), Exp (X-,^ + X,,) = (Exp X 1 )*(Exp X 2 ). 
This is clear from the product rule (53) » since, on the one hand, both assign- 
ments (150), (151) are logarithmic and, on the other hand, both are such as 



to be additive in X. 
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If a non-decreasing bounded function has a jump at x = 0, then it 

obviously has the form pe + X, where X is a non-decreasing bamded function 
having no jump at x = 0, and p is a positive constant, representing the 
jump given at x = 0. Since (154) and (17) imply the existence of the dis- 
tribution Exp (ps), it follows from (158) that, in the proof for the 
existence of the distribution Exp X for an arbitrary X, it will be suffi- 
cient to consider the case in which X is continuous at x = 0. It will turn 
out that in this case the existence of Exp X can be reduced to the existence 
of exp X for an arbitrary X. (incidentally, the existence of Exp X in the 
complementary case, where X = ps, is reduced to the existence of exp X for 
X = e., i.e., to the existence of jfj cf. the end of ^44). 

If X = Xj_ and X = X 2 are monotone functions satisfying (149), and if 
X^ * X 2 (x) is de ^ ined by placing [4.^ = X^» = X 2 in (48), it is clear that 

X = X^ * X^ is a monotone function satisfying (149) . It is also clear that 
(53) remains valid for y,^ = X^, y^ = X 2 » ^ X°(y) id defined by placing 
I* = X in (10). In particular, X 1 * X 2 (o» ) = X 1 (oo) X 2 (o»), 3 ince X°(0) = 

X(o«). And X(off) is the total variation of any X, since X(-^») =0. If X 

1 2 

is any monotone function satisfying (149), let X,X,...be defined by 
n n-1 1 

placing X = X * X and X = X. Then it is seen by complete induction 

. n n 

that (149) is satisfied by X = X , and that the total variation of X is 

the n-th power of X (<<**) = X°(0), since (X ) (y) is the n-th power of X°(y). 

Hence, if the trivial case X(o©) = 0, for which exp X and Exp X are taken 

care of by (157), is excluded, then X (x)/x n (o®) is a distribution and has 

the transform represented by the n-th power of X°(y)/X°(0), where n = 1,2,... 
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. v 0 

According to (15), all of this remains valid for n = 0, if X, is defined 

to be e for every X, Since the sum of the positive constants (nle) - ^, 

where n = 0,1,2,..., is 1, it follows that the first of the series 

n O* / 0 

T _i_ x W , y~ _i_ /X (y) 
n!e x”(o») n!e U 5 (o) 

n=0 n=0 ' 

is a distribution, and that the second series is the transform of this dis- 

W o 0 

tribution. But the sum of the second series is e /e, where w = X (y)/X (0). 
Hence, if X° is substituted from the case \i = X of (10), it follows that 
the distribution represented by the first series has a transform possessing 
a logarithm identical with the integral (150) . This proves that the assign- 
ment (150) defines a distribution exp X for every monotone X satisfying (149). 

Next, if h(u) is an abbreviation for u ^(e* U - 1 - iu), then 
h(u) = -§ + o(l) as u — 9 +0 and h(u) = 0(| u) ) as u-> + 00 , This ob- 
viously implies that, as n — * 

f -2 ixy . 

J x (e - 1 - ixy)dl — * 

l/n<|x|<oo 0<|x| <o° 

holds uniformly on every fixed bounded y-interval, if X = l(x) is any mono- 
tone function satisfying (149). But the integral on the right is identical 
with the integral (151), if X has no jump at x = 0, i.e.j. if (152) does not 
contribute to (151) . On the other hand, if X n (x) denotes a non -decreasing 
bounded function for which dX fl (x) = 0 or dX^/x) = dX(x) according as (xj< l/n 
or \x\ > l/n, then the integral on the left of the last formula line can 
be thought of as a complete integral (151) . It follows therefore from tne 
continuity theorem, formulated at the end of §10, that, in order to prove 
the existence of a distribution Exp X satisfying ^151), it is sufficient 



-2 . ixy 

x (e - 1 - ixy) d\ 
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to show the existence of the distributions Exp X , provided that \ is 

assumed to have no jump at x = 0. But it was seen after (153) that this 
assumption can be granted. Consequently, the existence of the distribution 
Exp X will be proved for every X, if it is proved for every X which is 
constant on some open interval containing the point x = 0. However, if 

there exists such an interval, then there exists a non-decreasing bounded 

_2 

function, say Xq , such that dl^(x) = x dX(x) for - c*< x <ej® . Hence, the 
integral (151) can be written in the form 

f ixy C _1 

J (e - l) <3 Xq(x) + icy, where c = - j x dX(x) 

(the integral defining the constant c is convergent, since X is constant 

near x = 0). It follows therefore from (150), (155) and (53), that the 

assignment (151) Is satisfied by Step X = °e * exp X fl , that is, by a dis- 

tribution which, according to (55), differs from the distribution exp Xq 

only in a translation. 



45. This proves that (150), (151) define distributions 
exp X, Exp X for any monotone X satisfying (149). It will now be shown 

that any semi-group of the type indicated at the beginning of ^42 is 
expressible in terms of these distributions and of (155) J distributions 
which, in turn, generate semi -groups for arbitrary choices of X and c in 
(150), (151) and (155) respectively. 

Let there be given for every value of a positive parameter t a 
distribution \i — [j, (x) on the line — O® < x in such a way that 
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(160) pt u+v = pi U * pi v , (0 < u <«• , 0 < y <o° ) 

and 

(l60* ) pi^— > s as t — ^ +0 j cf. (1$). 

Then the sheaf 0 < t < o 3 of distributions pi^ will be called a semi-group, 
and pi^ = its generator. The latter will simply be denoted by 

pi = pi(x). It turns out that the correspondence between these sheets of 
distributions pi^ and their generators pi^" is one— to— one. Let therefore a 
given distribution pi be called a generator if there exists a semi-group 
pi^ satisfying = pi. The object of the following consideaations is the 
determination of all those distributions pi which are generators. 

First, it is clear that, if X(x) is a monotone function having the 
properties (149), then tX(x) is a monotone function having the same proper- 
ties, where t denotes any positive number. Hence, exp (tX) and Exp (tX) 
represent distributions for every X and for every t. According to (158), 
both pi^ = exp (tX) and pi^ = Exp (tX) satisfy (160) for every fixed X. 
Moreover, it is clear from (150), (151), (28) and (56), that (160’) is 

satisfied in both cases. Similarly, it is obvious from (155) and (87) 

^ c*t 

that (160) and (l60 f ) are satisfied by pi = e, if c is arbitrarily fixed. 
Finally, it is seen from (56) and (52) that, if either of the requirements 
(160), (160 1 ) is satisfied by two sheaves of distributions, say by pi^ = pi^ 
and by pi^ = pig^, then it is satisfied by the sheaf pi^ = pi^ # pi ^ also. 
Consequently, if X’ and X" are arbitrary monotone functions satisfying 
(149), and if c is any real constant, then 

(161) pi = e * (exp X 1 ) * (Exp X") 

1 . t 

is a distribution representing the generator, pi , of a semi-group pi $ 
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0 < t <0». And what will be shown is the converse, namely the fact that 

a given distribution, |x, is representable in the form (161) whenever it is 
a generator, 



46 . If r is any fixed positive number, then (150) and (151) 

show that the contributions of the ranges \ x) < r and |x| > r to 

o o 

log (exp X*) (y) and log (Exp X") (y) are 



f 



(e** 7 - 1) dX' and 



-2 

x 



/ ixy 
(e 



- 1 - ixy) dX n 



\ xl <r \ x\ >r 

respectively. Consequently, these contributions can be written as 

iay + f x“ 2 (e 1 ^ Cy - 1 - ixy) dX« and iby + J (e** 7 - 1) dX" 

I x J & \ x | >r 

respectively, where a and b are real constants and X'^, X"q denote func- 
tions of the same type as the functions X*, X M , that is, non-decreasing 
bounded founctions (cf. the last formula line of § 44) . Thus it follows 
by the reasoning applied at the end of § 44 > that the contributions of 

the complementary ranges, |x| < r and Jx| > r, to exp X* and Exp X" can 

a b 

respectively be represented as s * Exp X* and s * exp X"q. It follows 
therefore from (52), (87) and (158) that, if a distribution |i can be 
represented in the form (l6l), then the number c and the non-decreasing 
bounded functions X’, X” occurring in (l6l) can be so chosen that X' and 
X" are constant on the interval | x \ < 1 and on both of the complementary 
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half -lines, | x | > 1, respectively. Thus it is clear from (150), (151) and 



(155), that the representation (l6l) is equivalent to 

1 

(162) log [x°(y) = icy + ( 



f - [ He 1 * 7 - 1) + 



-2, ixy 

x (e - 1 - ixy) dX, 

-1 



where c is a real constant and X. denotes a monotone function satisfying 



(149). 

Accordingly, what has been verified above is that, if there exists 
a pair (c,X) satisfying (162) for a given distribution [A, then ).i is a gener- 
ator. And the converse to be proved is equivalent to the statement that 
there exists for every given generator \l a pair (c,X) satisfying (162). 

Incidentally, the replacement of (l6l) by the normalized represen- 
tation (162) is actually such as to make the correspondence between gen- 
erators [A and the pairs (c,X) a one-to-one correspondence. In other words, 
if (c^,X-^) and ( c 2>^2^ ^ e ^ on 6 to the same |A, then c^ = C2 and X^ = X^. 

In order to prove this, let a = c i ~ c 2 a = ~ ^2 • ^ then 

follows from (162) by subtraction, that 

f 1 1 

(l63)0=iay + ( J - ^ )(e^ X ^ - 1) da + ^ x -2 (e^ X ^ - 1 - ixy) da. 

-1 -1 



Since X^ and X2 are monotone functions satisfying (149), the function a = a(x) 

is 

is of bounded variation,, continuous from the left, and vanishes as x ■ 



' - tyo 



Accordingly, the assertion is that (163), where J^l da| < 0° , cannot hold 
(for every y) unless a = 0 and a(x - 0) = const. 

For a fixed real number b, integrate the identity (163) between the 

and divide by 2. 

limits y = b - 1 and y = b + 1^ Then, if b is renamed y, the result appears 
in the form ^ ^ 

0 = i&y + ( ^ ^ )e ljCy (sin x/x - 1) da + j* x“ 2 (e ^sin x/x - 1 - ixy) da. 



-1 



-1 
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where sin x/x = (sin x)/x. If this is subtracted from (163), it follows 



that 



x -2 i 

0 = ( J* - j ) e 1X ^(l - sin x/x) da + f x e ^(l - sin x/x) da. 

- 1 -1 

This can be written in the form 0 = J' e^ X ^ dp, where 

r (l - sin x/x)da(x) if |x| > 1, 



dp(x) = ' 



x ^(1 - sin x/x) da(x) if |x| <1. 

Since (l - sin x/x) and x 2 (l - sin x/x) are regular at x = 0, and since 

J* / dee | < 00 , it is clear that J / dB j < «o . it' follows therefore from (19*) 

that p(x - 0) = const., since 0 = J dQ for every y. But p(x - 0) = 

const, and the preceding formula line obviously imply that a(x - 0) = const. 
And a = 0 now follows from (163). 



46! In the parallelism referred to at the beginning of §42, the 
relation (97) corresponds to (l60'). However, while it is possible to prove 
that (97) must hold if *jx n is assumed to be convergent, it is impossible to 
disprove the negation of (l60') if (l6o) is supposed to be satisfied. By 
this is meant that an admission of the continuum hypothesis leads to sheaves 
|^ t which satisfy (l6o) and violate (l60'). In fact, while every measurable, 
and every locally bounded (or half bounded) solution of the functional equa- 
tion 



(164) 



f(u + v) = f(u) + f(v) 



is known to be represented by the linear functions* f(t) = f(l)t, the con- 
tinuum hypothesis supplies non-measurable solutions f(t) of (164) . But if 
f(t) is any real-valued solution of (l64)> it is clear from (87) that (16c) 
is satisfied by (J,^ = ^ ^ej while (l60 1 ) will hold only if f(t) is so chosen 
as to satisfy f(t) = o(l) = 0(1) as t — ^ 0. This is the reason that both 
(160) and (160 1 ) have been required in the definition of a semi-group 
According to (53) and (160), 

(165) (|x u+ V(y) = (H U )°( y) . ( (1 T )°(y). 

This implies that (p D ^) (y) is the n-th power of (p^)°fy) f or n = 1,2,... 

and for every t. On the other hand, it is clear from (l60’) and from the 
uniformity result of /l0, that there exists for every p > 0 a sufficiently 
small q > 0 such that | (|i^) (y) \ > y whenever -p < y < p and 0 < t < q . 

V Jr 

Consequently, 

(166) I ([/k) (y) I >(i) if -p < y < p and 0 < t < nq^. 

Since p can be chosen arbitrarily large, it follows, by letting n — * 
when p is fixed, that 

(167) (|J.^) (y) ^ 0 for - &>< y < o° and 0 < t < 00 . 

According to £ 10, every transform is a continuous function of y. 
Hence, if a transform is distinct from 0 for every y, there exists for 
- o» < y <o° exactly one continuous function representing a logarithm of 
the transform for every y and attaining the value 0 for y = 0} cf . (12) . 
This, when compared with (167), defines 
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as a unique continuous function of y for - cx>< y <o° , if t is fixed. On 
the other hand, (165) means that the function (168) satisfies (164), if y 
is fixed. But (166) and (11) imply that the solution (l68) of (164) is 
bounded on every bounded t-interval, if y is fixed. Consequently, (168) 
is a linear function of t for everjr fixed y$ so that f (t) = f (l)t. If 

y y 

this is substituted into (168), it follows from (165) that 

(169) (y*) (y) = (y (y)) t , where ft = ft 1 ; 
so that, according to (167), 

(170) y°(y) ^ 0 for — 0°< y <0 0 . 

It is understood that the exponential, ( )^ = ^ \ on the right of 

(169) is defined in terms of the continuous logarithm specified by (168). 

Accordingly, (l60) and (l60* ) together are equivalent to (169). 

In fact, it is clear from (28) and (15) that (l60') is implied by (169). 

It also follows from (169) and (170 ) that 

(|i t )°(y) - 1 

(171) 7 >log n°(y) as t — »0 

X 

holds uniformly on every fixed bounded y-interval (in fact, both w = y* 3 (y) 
and 1/w are bounded, and therefore (w^ - l)/t — > log w holds uniformly, 
on every such y-interval). 



47. These results, proved for an arbitrary semi-group y^, will now 
be used in order to prove that every given generator y = y^ is representable 
in the form (161). 
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In terms of the semi-group which is generated by a given pi = pi^, 
define, for every fixed t > 0 and for -©*< x < 0 ° , two monotone functions, 
say X'.j. = X’^(x) and = X"^.(x), by placing 

"o if \x( £1, 
t^dpi^x) if \x | > 1 



(172) 



<*X' t (x) = 



and 



(173) 



dX" + (x) = *, 



ft-Vd^(x) if |x) < 1, 






0 if \ x \ > 1 

(where d = d^.) . Since pi^, where t is fixed, is a distribution, it can be 
assumed that (149) is satisfied by both X = X'^. and X = X"^. 

According to (172) and (173), the logarithms, (150) and (151), 

of the transforms of the distributions exp X* and Exp X n ^ are respectively 

t 

represented by 



J( J- ( 



)(e* X ^ r - l)dpi^(x) and t ^ ^ 



, ixy t 

(e - ixy - 1) dpi ( x ). 



Hence, if c, is an abbreviation for 

X 1 

(173) c t = t_1 J xfyV), 

the sum of these two logarithms is identical with 

i C ixy + 
t -1 J (e - 1) dpi (x) - ic t y, 

and so, in view of (10), (12) and (155) » with 

c t 

CV)°(y) -t" 1 - log( e)°(y) . 

Accordingly, the sum of the logarithms of the transforms of the three 
c + 

distributions ^e, exp X'., Exp X" is identical with the quotient on the 

“ t 

left of (171). Since the relation (171) holds uniformly on every fixed 
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bounded y-interval, it follows that the same is true of the relation 

C t Q Q 

(174) ( e) (y) •(expl’.j.) (y) . (Exp 1"^) (y) (j, (y) as t — ^ 0; 

a relation which, according to (82) and (28), is equivalent to 

Cx 

(175) ( e) * (exp \' t ) * (Exp X" t ) — * as t — *0. 

While (175) indicates, it does not in itself imply, the exist- 

the 

ence of a representation (l6l) . However, ^transition from (175) to (l6l) 
can now be based onthe estimates (21), (22). 



48. First , (22) and (21) show that, if t and r are arbitrarily 
fixed, p r 

( ^ ** J ) ^(x) and y x 2 d^(x) 

-r -r 

are respectively majorized by the maximum of4jl - (|A^)°(y) j on the inter- 
val -1/r < y ^ 1/r and by the minimum of 3 )l - (lA t )°(y) |/y 2 on the same 
interval. If r > 1 in the first case and r = 1 in the second case, these 
estimates can, by (172) and (173) » be written as 

r r 

(176) ( ) - f ) dX» (x) < t" 1 Max | 1 - (^ t )°(y)f , ( r > 1) 

_ r -l/r<y<l/r 

and 

1 

(177) I = j x 2 dX n (x) < 3t 1 Min |l - (|/)°(y)|/y 2 

3 \ = -i<y < i 

w — 
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respectively. But (171) holds uniformly on every fixed interval 

-l/r < y £ l/r, and (170) shows that the function on the right of (171) 
is a continuous function which, according to (12), vanishes at y = 0. 

It follows therefore from (176) and (171) that the total variation of 
X'-^x) on the exterior of the interval -r < x < r tends to 0 as t — ^0 
and r — > oo . It is similarly seen from (171) and from (177), or rather 
from the proof of (177), that J* x^dX n ^(x) = 0(1) as t— ^0. Actually, 
this 0-estimate is equivalent to | dX"t (x) = 0(1) as t — * 0, since, ac- 
cording to (173), the functions X = are constant for - Q> < x ^ -1 
and for 1 < x <o° . 

Since both X = X'^ and X = X"^ are monotone functions satisfying 
(149) for every t, it follows that, as t— > 0, the functions X 1 ^ and X"^. 

are uniformly bounded for -o»< x <<oo . Consequently, Helly's selection 
theorem is applicable. It supplies the existence of a sequence t^ , t£ , • ♦ • 

and of two monotone functions, say X' and X n , such that t — ?0 as n — , 

n 

both X = X' and X = X M satisfy (149) and, if X' Q , X" n respectively denote 
the functions X’^., X"^ belonging to t = t n , the functions X' n , X" n tend 
to X’, X” at every continuity point of X' = X*(x), X"(x) respectively. 

In addition, the total variation of the monotone function X' n 
on the exterior of the interval -r <, x 1 r tends to 0 uniformly for all n, 
as r — ► oo . Since X’ n tends to X 1 at every continuity point of X* = X*(x), 
this implies that, as n — ? , the integral (150) belonging to X = X* r 

tends to the integral (150) belonging to X = X’j so that 

(exp X' rj )°(y) — * (exp X»)°(y). 



( 178 ) 



On the other hand, the monotone functions X" are uniformly bounded for 

_ o*< x <o» and are independent of x for - o®< x < -1 and for 1 < x <©° . 
Since X*' tends to X" at every continuity point of X” = X”(x), this implies 
that, as n — > the integral (151) belonging to X = X" n tends to the 
integral (151) belonging to X = X" j so that 

(179) (Exp X” n )°(y) — ^ (Exp X")°(y). 

According to (150), (151) and (170), the limit functions occurring 
in (178), (179) and (174) are distinct from 0 for every y. It follows 
therefore from (178), (179) and (174) that, if t = t n , the first factor on 
the left of (174) must tend to a limit as t = t Q — ^0, i.e., as n — . 
Since this holds for every y, it follows from (155) that there exists a 
constant c such that, if c Q denotes the value c^ belonging to t = t n , then 
c n i»e«, 

c n 

(180) e (y)— *■ e(y) . 

In view of (56), (53) and (28), the relations (180), (178) imply 
that c» 

n q 

( e) * (exp X * n ) * (Exp X" n ) -* ( e) * (exp X') * (Exp X»). 

If this is compared with (175) » where t = t^, it follows that (I can be 
represented in the form (161) . 

Incidentally, it is seen from the proof that, in the analogy men- 
tioned at the begir&ng of J> 42, the three members on the right of (161) 
are paralleled by the three series (111) . 



* 



no 

Inversions 



49. In view of (19), there arises the question as to the existence 
of an explicit inversion formula for (10) j that is, of a formula represent- 
ing an arbitrary distribution pi = p(x) in terms of its transform pi° =5 pi°(y) . 

In order to obtain such a formula, it will first be verified that 

(181) 1 - 2 ( si (rt - rx) dpi(t) — * pi(x + 0) + pi(x - 0) as r — > 

op u 

where ^ and si(u) = J v ^sin v dv. 

-O 9 o 

Since pi(o° ) = 1, pi(- 0° ) = 0 and si(+o») =±|#, a partial inte- 
gration shows that 

(182) j"si(rt) dpi(t) = %tc - j' pi(t)t ^sin rt dt. 

Since pi is monotone and bounded, it is clear from the second mean-value 

theorem that, for every fixed a > 0, the contribution of both half-lines 

-®°< t < -a, a < t <' c>0 to the integral on the right of (182) tends to 0 

as r *■ o’ . On the other hand, since ii(t) = pi(+0) + o(l) as t— »+0, 

the contribution of the interval -a < t < a is 

a 0 a 

pi(+0) J" t ^sin rt dt + pi(-0) [ t ^sin rt dt + J*o(l)t ^ sin rt dt. 



U 



-a 



-1 



which, since si(u) = J" v sin v dv, can be written in the form 

® ar 

r _i 

pi(+0)si(ar) - pi(-0) si(-ar) + o(l) t sin t dt. 

J 

-ar 

Since | t ^sin tj < 1 and si(+ <x> ) = it is clear that a can so be 

chosen as a function of r that, as r — £ o<» , the function a tends to 0, 



the product ra to co , finally the expression in the last formula line 
to ^Jtpi(+0) + ir)t pi(— 0) • 



Ill 



If all of this is substituted into the integral on the right of 
(182), the identity (182) proves the particular case x = 0 of (181) . But 
the case of an arbitrary x can be reduced to the case x = 0 by shifting 
to t = x the origin of the t-axis on the left of (181) . Hence, (181) is 
true for every x. 

There is a similar relation for the case where the sum on the 
right of (181) is replaced by the corresponding difference, i.e., by the 
(non-negative) jump of the distribution |X at a point xj in fact, 
r sin (rt - rx) 



(183) 



rt - rx 



dji(t) ^x + 0) - fi,(x - 0) as r 






In contrast to the proof of (181), which depends on the second 



mean-value theorem, the proof of (183) requires only the first mean-value 
theorem. In fact, since j"jd[.i(t) | < 0° and Jsinf < 1, this theorem implies 
that a 

(184) r -1 JV : ^sin(rt) dji(t) = r 1 j" t sin(rt) d^(t) + o(l) as r— 

-a 

holds for every fixed a > 0. On the other hand, since (J,(t) — >[a(+ 0) a 

t — *+0 and z ^ sin z — * 1 as z — ^0, it is clear that a can so be chosen 
as a function of r that the expression preceding the o(l) on the right of 
(184) tends to |a(+ 0) - |x(-0) as r — » o°. It follows therefore from (184) 
that (183) is true in the case x = 0. Finally, the case of an arbitrary 
x can be reduced to the case x = 0 by shifting to t = x the origin of the 
t-axis on the left of (183). 



50. As a corollary of (181), it will now be verified that an 



explicit inversion of the transformation (10) of a distribution [X into its 



transform (X is given by 

|x(x + 0) + |x(x - 0) 1 

(185) = const. + 2# If® 

2 r-#o<> 



1 - e 



-ixy 



iy 



lAy) dy 



v/ 

-r 



for every x (the numerical value of the constant follows by placing x = 0). 
Correspondingly, (l83) will lead to an explicit representation of the (non- 
negative) jump |x(x + 0) - |x(x - 0) in terms of the transform, namely to 



(186) fx(x + 0) - fx(x - 0) = lira 



r-j»oo 



2r 



e 1X7 |x°(y) dy. 



-r 



Since (1 - e Z )/z — ?l as z — -^O, the integrand on the right of 



(185) remains continuous at y = 0, and so the existence of the integral on 
the right of (185) is obvious, if r <0® . On the other hand, the existence 
of the limit on the right of (185) is part of the statement. And this limit 
cannot in general be replaced by the Lebesgue integral, J”* , over the line 
- o»< y <o° . 3n fact, it cannot in general be replaced by the ordinary im- 
proper integral ^ j that is, by the limit of the contribution of the range 
-p 5 y 1 r as r and p tend to ©3 independently, rather than so as to be 
subject, as in the "principal" integral (185), to the restriction r = p. 

In order to prove the necessity of this restriction, it is suffi- 
cient to show that, if the contribution of the range -r < x £ 0 is omitted 
in (185) , the resulting integral over the range 0 < x < r need not have a 
limit as r - — ^ 00 . But, if x = 1 and |X°(y) = cos y, then the imaginary 

part of the integral just mentioned is 



112 bis 



r 



f (1 



r 



as r 



0 



(1 - cos y)y ■ L cos y dy = Const. + o(l) - j y -1 cos 2 y dy-^ - o° 

1 

— ? o° . Hence it is sufficient to ascertain the existence of a 



y cos y dy — 



distribution ^x(x) for which (. 1 ° ( y ) becomes cos y. But (10) shows that 



such a |.i (x) is represented by the step-function which is 0, \ or 1 accord- 
ing as -o®< x < -1, -1 < x < 1 or 1 < i < . 

On the other hand, the limiting average on the right of (186) always 
exists, and has the value represented by the left of (186), even if the 
range -r < y ^ r, its length 2r and the limit proces 2r — > o=> are replaced 

by a range u < y < v, by its length v - u and by the limit process v - u — ^ 

respectively (so that not only u = 0, v-*** 3 , or u — v = 0 but, if 
v - u — » co , even u — ? c« , v — =* oo or u — - o=> , v — t 

are allowed). However, this generalization of (186), which is proved in 
exactly the same way as (186) itself, will not be needed. 



51. The symmetry restriction in (185) will be used by writing the 
integral (185) in the form 



r 



r 




-r 0 



Replace x in (10) by t and substitute the resulting representations of |x° (y) 
and |x° ( — y ) into the last formula. This gives 
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| . J ii-fjfe 1 * - e- ity )a^(t) - J(e 1(t - X)y - e i( ^ t)y ) *(t )J dy, 



-r u 
that is. 



) J dy. 



j = 2 J | y 1 J" sin(ty) djx(t) - y _1 f sin(tx - ty) d(.i(t) 

_ r 0 ** 

Since (sin z)/z remains continuous at z = 0, and since r < oo and 

JW(t) I < oo , the order of integrations can be interchanged here. Thus 
r r r 

J = 2 J I J 7 1 sin ( t y) dy - J y 1 sin(tx - ty) dy j dp,(t) 

-r 1 0 0 ' 

, r -1 

or, since si(u) = v sin v dv. 



r 

J = 2 J"si(rt) d(j,(t) - J' si(rt - rx) dfi(t). 



-r 



Since this is identical with the integral on the right of (185), the asser- 
tion of (185) follows from (181) and from the case x = 0 of (181). 

Similarly, if x in (10) is replaced by t, the integral on the right of 
(186) appears in the form 

J = J { Je ltJ *Ct) ♦ e 1Xy J *<*>) dy, 

-r 0 

which can be rearranged into 
r r 



HfK 

-r o 



(t-x)y i(x-t)y 



+ e 



r 

) dyj d|*(t) id 2 cos (t - x)y dyj d|A(t). 



1 14 



Since the interior integral, I 1 , is 2r times the integral on the left 
of (183), it follows that (186) is identical with (183). 



52. For a given distribution p, let p, as before (17), denote the 

-1 

distribution given by ^p(x) = 1 - p(--x) at all of its continuity points. 

Then an x is in the point spectrum of p if and only if -x is in the point 

-1 

spectrum of [A . It follows therefore from (72), (68) and from the definition 
of a vectorial sum, that the point spectrum of p is vacuous if and only if 
x = 0 is not in the point spectrum of ^p * [A. Hence, if [A and x in (186) 
are replaced by p * p and x = 0 respectively, it is seen from (54) that 
the condition 

r 2 

(187) ^ | p°(y) ) dy = o(r) as r — ? o« 

0 

is necessary and sufficient in order that the distribution p be continuous 

in the sense of ^4. It follows that 
r 

(188) ^ 1 p°(y)\ dy = o(r) if and only if p is continuous. 

0 

In fact, it is clear from (11) that the o-condition of (188) implies the 

o-condition of (187) j while the converse inference is clear from Schwarz’s 
inequality. 

A corollary of (188) is that 
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o 

(189) if lira [j. (y) = 0 , then \i is continuous, 
y-r + 'o® 

But it will be shown in § 6 3 that 

(190) the converse of (189) is false. 

On the other hand, a weakened form (189) has a converse, represented by 



the Riemann-Lebesgue lemma, according to which 



(191) if JJ. is absolutely continuous, then lira (y) = o. 

y _*4yoo 

But it will be shown in ^ 64 that 

(192) the converse of (191) is false. 

As to (191), it is clear from (10) that, if (i is absolutely con- 
tinuous and has a density |i'(x) which is constant on an interval and van- 
ishes outside this interval, then (i°(y) = 0(\y\ "**) = o(l) as y — 9 + o® . 
Hence, the same is true if y,'(x) is a step-function, having a finite num- 
ber of intervals of constancy and vanishing outside a bounded interval, 

say -r < x £ r. If is any absolutely continuous function, and if ju° (y) 

^ — * r 

is defined so as in 6 10, then 

* r r 

M°(y) r = C e 1X7 |Ai(x) dx; hence) |Ay) r - J e 1 * 7 f(x) dx £ j | (x) - f(x)| 



dx, 



-r 



-r 



-r 



where f is any L-integrable function on the interval -r < x < r. Since u' (x) 
is L-integrable, there exists for every s > 0 a step-function f = f having 
a finite number of intervals of constancy and such that the integral on 
the right of the inequality of the last formula line becomes less than e. 
Finally, it is clear from £ 10 that there exists for every e > 0 an r = r g 
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such that I p (y) - p (y) I < s for every y. This, when combined with the 
1 r 

inequality in the last formula line, gives. 

r 

| p (y)l < s + e + | ^ f (x) dx | for every y. 

-r 

-1 

Since the last integral was seen to be 0()y| ) = o(l) as y_^+c*>> the 

proof of (191) is complete. 

There is no explicit criterion necessary and sufficient 

transforms p°(y) belonging to absolutely continuous distributions p(x). 

On tlie other hand, it is seen from (10) and from Plancherel’s completeness 

theorem, that a transform does or does not belong to an absolutely con- 

2 

tinuous distribution p(x) having a density u T (x) which is of class (L ) on 
the line -*>°< x <o® according as p (y) is or is not of class (L ) on the 
line - P° < y < o° . 

In particular, the existence of an e > 0 satisfying p°(y) = 

_i_ e 

0 ( | y | 2 ) as y — » + o=> is sufficient in order that p be absolutely con- 

this 

tinuous. It can be shown that A is false for every e < 0. 



53. Thus it is of interest that 
(193) j| p°(y)| dy < ©o 

turns out to be sufficient for the absolute continuity of p. In fact, (193) 
is sufficient in order that p(x) should possess a density u'(x) which is 
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continuous for - o»< x < o° and tends to 0 as x — > + oo . 

In order to see this, it is sufficient to observe that, if (193) 
is satisfied, then integral (185) and the integral j jJ.°(y) dy, 

which results by differentiation of (185) beneath the integral sign, are 
uniformly convergent for -°°< x <- 0 ° and represent therefore continuous 
functions of x. Hence it is clear from (185) that, if (193) is satisfied, 

(10) can be written in the form 

(194) p.°(y) = Je ix y {x’(x) dx, 
where 

(195) H’(x) = Je 1PC P°(y) dy. 

Finally, a repetition of the proof of (191) shows that (193) and (195) 
imply that (,i 1 (x) — 3> 0 as x — > + 00 . 

It may be mentioned that, since [x(x) is supposed to be a monotone 
function satisfying (8), the conditions under which the "Fourier inversion", 

(195), of (194) was just established are symmetric in x and y; in fact, (193) 
is paralleled by J'(u'(x)|dx < o° , since ^^’(x^dx = ^d(.i(x) = 1. Further- 
more, it was shown in 10 that (J.°(y) is always continuous. Finally, 

M°(y) 0 as y— *•+«*, by (191). 

If the assumption (193) is replaced by 

(196) J | y\ lH°(y)| dy <00 , 

where k is a non-negative integer, then the integrals which result by m-fold 
formal differentiation of (195) are uniformly convergent for - oe>< x <<oo , 
if m < k. Consequently, (196) is sufficient in order that |J.(x) should have 
a continuous density possessing continuous derivatives of any order not 
exceeding k. Since all these derivatives can be obtained by differentiating 
(195) beneath the integral sign, it follows by a repetition of the proof of 



(191), that all these derivatives tend to 0 as x — > + oo . 

The existence of some q > 0 satisfying 

(197) |Ay) = 0(/ y r k " 1_q ) asy-j+oo 

is sufficient for (196). In particular, the existence of some p > 0 
satisfying 

(19?) H°(y) = 0(exp(-!yP) ) as v — + ± oo 

is sufficient in order that (196) be satisfied for every k; so that jr 1 (x) 
then has derivatives of arbitrarily high order for - oo < y < oo , gut the 
formal Taylor series of u'(x) at x = 0, i.e., the power series 

» CO 

(199) X ^( n+1 ) ^o) x n /n! = (2«) _1 X^(-l) f y” U°(y) dy x^/nl, 

n=0 r n =0 J 

can then be divergent for every x ^ 0. In fact, it will be seen in ^54 
that there exists for every;- p < 1 a distribution satisfying (198) but 
rendering the series (199) divergent for every x ^ 0. 

However, if (198) is satisfied for p = 1, and, more generally, if 
there exists a positive q satisfying 

(200) H°(y) = 0(exp(-q |y! ) ) as y— * + oo, 

then i.i.*(x) is regular-analytic for _ oo < x < oo . i n fact, there must then 
exist in the strip Ivl < q of the w-plane, where w = x + iv, a regular- 
analytic function which is identical with ' (x) on the real axis. In addi- 
tion, this regular-analytic continuation must then be bounded on every 
strip J v I < r, if r < q. All of this follows by observing that the inte- 
gral which results by replacing x by w = x + iv in (195) is uniformly con- 
vergent, and represents a bounded function, on the strip Ivl < r, if r is 
less than the constant q for which (200) is supposed. 
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If (198) is satisfied by some p > 1, then (200) is satisfied 
by every a, and so ji* (x) must then possess an analytic continuation which 
is an entire function. In addition, the latter must then be bounded on 
every fixed strip parallel to the real axis. Finally, the order of this 
entire function cannot exceed p/(p - 1). This is readily verified from 
Stirling's formula, since (198), where by [assumption, p > lj lmplies that 
the coefficient of x° in (199) is majorized by a constant multiple of 



o° 



( 201 ) 



J / exp(-y P ) dy = p P ( [n + lj /p)/ni. 



54. As an illustration, consider the distribution u = u, for 

P 



which the transform is exactly 

( 202 ) 



^p°(y) = exp( - \ y| P ), 



provided that ja exists. This proviso is necessary, since the function of 
x which results if (202) is substituted into (195) can attain negative 
values and, therefore, fail to be a density. 

Actually, the distribution u cannot exist if p > 2. In fact, 

Jr 

° P 

(202) implies that (y) - l<~w - |y( as y — ^ 0. Hence, the o-condition of 

P 

(20) is satisfied whenver p > 2. Consequently, (20) states that, if p, 

r 

Of \ 

exists for a p > 2, then = e. But then ^ (y) = 1 for 
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all y, by (15). And this contradicts (202), the limiting case p = 0 being 
excluded by p > 2. 

It will from now on be assumed that 0 < p < 2. The existence of the 



distribution ju.p for every such p will be proved in § 56. Let this be granted 
for the present. 

Since (202) is an even function, the coefficient of (2A,) - ^ x 11 in (199) 
is 0 or 2i n times the integral (201) according as n is odd or even. Accord- 
ingly, the Taylor series at x = 0 for the density of [ip is 

y (n+l) n, , _ _1_ T , lN n P( O + jj /p) 2 n 

(203) -Z- [hj (°) x / n * ~ «p Z- (-1) P(o n + 1 'l x 

n=0 P n=0 I ( 2n + !) 

According to Stirling's formula, the radius of convergence of (203) is 0 

or oo according as p < 1 or p > 1. This agrees with the end of S 53 . 

In the limiting case p = 1, the radius of convergence of (203) is 
neither 0 nor oo and the density is seen to become 

(204) 

(for all x) . This rational function becomes singular at the points +i. 



points having the distance 1 from the origin. Since (200) is now satisfied 
by q = 1, it follows that the conclusions drawn from (200) in § 53 cannot 
in general be improved. 

If p = 2, then, since P (n + ■§•) = # 2 2 2n P (2n + l)/nl, 

(205) (J-2*( x ) = exp(- ||x| 2 ). 

If the case p = 2 of (202) is compared with (154) , it is seen from 

(10) and (17) that (205) is equivalent to 
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(20*5) ^ ©’(x) = 2 ex P (“J x2 )> where e°( y ) = exp (-ly 2 ). 



( 206 ) 



Since substitution of (202) into (195 ) shows that 

H-’pCx) = # cos (xt) dt, 

0 



the density [4, 1 (x) is an even function for every p. It can therefore be 

assumed that x_> 0. 

A partial integration of (206) gives 
o° 

r _ t P 

(4,p* (x) = tc p j e t P S(xt) dt, where S(z) = (sin z)/z. 

0 

. 1/p 

Hence, if t is replaced by (t/x) for a fixed x > 0, it follows that 
(207) 



fJ. '(x) = it 1 a (l/x P )/x P 1 , 
P P 



t P V 
(x > 0), 



where a (z) is an abbreviation for 

P ^ 



(208) 



a p (z) 



Je' 



zt l/p 

sin (t ) dt. 



If p < 1, the Laplace transform (208) is convergent at z = 0 and 

has there the value ctp(0) = P (p) sin 0|#p)j so that, since the integral 

analogue of Abel s continuity theorem ensures that a (+0) = a (0), there 

P P 

results the limit relation a (e) — * P (p) sin (y#p) as e — >+0. Actually, 

instance 

it is easily verified from (208), for* by residue calculus, that (in accord- 
ance with Heaviside's ideas) this limit relation remains valid for the case 
p > 1, i.e., for the case in which the integral a (0) is divergent. Con- 

Jr 

sequently, from (207), 

(209) •(*)-* tf" 1 P (p) sin (|*p) as x — * 0»» 

r' 
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If p < 2, the constant on the right of (209) is positive, 

and so (209) implies that ^ ( x \ P dp, (x) <°°ifO<p-e<p hut 
f P P 

J \ x| dPp(x) = <>» (if p = 2, then (204) shows that the situation is 
quite different). It follows, in particular, that the second moment, 

f x 2 dp p (x), is for every p < 2. This is implied by the easier 
results of § 12 also, since, if p < 2, it is obvious from (202) that 
there cannot exist two constants, Cq (=1) and c^, satisfying (34) for 
n = 1, 



55. In order to discuss the analytic character of , it can 

be assumed that p < 1, since p^' is the rational function (204) of the 

transcendental entire function (203) according as p = 1 or p > 1, 

If p < 1, then, while p, * (x) has derivatives of arbitrarily 

P 

high order for -oo < x <<*» , the series (203) diverges for every x ^ 0. 

More than this will now become understandable from (207). In fact, it 

will be shown that <*p(z) is a transcendental entire function, if p < 1. 

Needless to say, the Laplace expansion (208) of this entire function is 

divergent on the left of the imaginary axis. But it turns out that, if 

p < 1, the function defined by (208) on the right of the imaginary axis 

possesses the analytic continuation 

co 

(210) ZI a J' ^(0)z /n* = p 2 (-1) P ( fn + lj p) sin fn + l) p) z n /nl, 

n=o * n =0 , J 



123 



If p = |r, then (210^ can be contracted, corresponding to (205), 



into 



o» 



(2U) = (J*)V (ia) - ZT(-l) , ‘r.U 2n+1 /(2n + 1)1 . 

n=0 

In order to prove (210), where p < 1, let q and r be positive 
constants and let 

f i _ p 

j f(z) dz, where f(z) = z 1 ^ Z TZ , (z b > o for z > 0) 

G 

be applied to the contour C consisting of the two segments and two cir- 
cular arcs 

i 0 

z = t, z = itj z = ee , z = Re , 

where 0<e<t<R<o» and 0 < <fi < §#. Since q > 0, r > 0 and 0 < p < 1, 
it is readily ascertained that the contribution of the two circular ares 
to the contour integral tends to 0 as s — 0 and R — . Hence, the 
integrals of f(t) and of if (it) over the half -line 0 < t <o° have a com- 
mon value. Thus, by definition of f(z), 

0 \ f -rt^.l-l it f«qi f -t-rt q cos^j<p q-1 -irt%in^*p 
(212; je t e dt = e Je t e dt. 



Since p > 0, the n-th derivative of the regular-analytic function 

represented by (208) in the half-plane to the right of the im aginary axis 

can be obtained by differentiating (208) beneath the integral sign. Thus, 

if z = r > 0, o° 

(n) 

“p « = (-D 



oo 



( n ) / x . . a ^ -rt^n ^ # l/p ^ n -rt^ (n+l)p-l 



sin(t )dt = (-1) p \e t 






sin t dt. 



if t is replaced by t . Hence, by the imaginary part of (212), 
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a p (n ^ (r) = (-l) D p 



c*= 

-t-rt cosJ*p (n+l)p-l 

e t sin(J*£n+l]p - rt P sin J#p) dt. 



where (n + l)p = q and 0 < r < . Since 0 < p < 1, it follows that 



06 



(213) |a (n \r)| < pje V^^dt = p P ( fn+l] p) 



for 0 < r <0° , 



(»), 



and that o ' '(r) therefore tends, as r — 9+0, to the limit 

CO 

n f -t (n+l)p-l _ 

(-1) p J e t sin Oi# [n + 1J p) dt. 



(a) 



In view of Rolle’s theorem, this implies that the derivative a (0), 

P 



when thought of as a right-hand derivative, exists and, has the value 
represented by the last formula,, line, i.e., by the value formally 

assigned by the assertion (210) . 

Finally, since (208) represents a regular-analytic function in 
the half-plane to the right of the imaginary axis, a^(z) can be developed 
in the vicinity of every z = r > 0 into a power series, 

< 7 ° 

a (z) = ct^(r)(z - r) /nl, 

F n=0 v 

having a radius of convergence not less than r. But (213) shows that this 



power series is dominated by one which, in view of Stirling's formula, is 

convergent in the whole (z - r) -plane, since 0 < p < 1. Consequently, 

a (z) is an entire function. Hence, (210) represents a (z) for every z. 

P p 

If p > 1, then (210) diverges for every x ^ 0, as (203) 
does in the case p < 1. If p = 1, then (210) converges in the circle 
| z f < 1 to the function l/(l + z ); so that (207) becomes the standard 

functional equation of the rational function (204). Finally, (203) and 

(210) are transcendental entire functions of the respective orders 

u - p- 1 )- 1 , (l - p)- 1 in the complementary cases p > 1, p < 1. 






If p > 1, then (210) diverges for every x ^ 0, as (203) does in the 
case p < 1. If p = 1, then (210) converges in the circle Izl < 1 to the 
function l/(l + z^)j so that (207) becomes the standard functional equation 
of the rational function (204). Finally, (203) and (210) are transcendental 
entire functions of the respective orders (1 - p“l) 1 , (l - p)“l in the 
complementary cases p > 1, p < 1. 



56. What remains to be shown is the existence of a distribution |Xp 
satisfying (202), where 0 < p <.2. 

It turns out that, in order to prove the existence of (4^ for a given 

p, it is sufficient to ascertain the existence of some distribution X = Xp 

satisfying the relation 
o 

(214) X p (y) s 1 - y P + o(| y| p ) as y-f 0 

(this relation is satisfied by (202), if Xp = Up is known to exist, but it 
can easier be ascertained for some Xp than in the explicit case required) . 
In fact, since (214) is equivalent to log Xp°(y) = - y P + o(| yj p ) , it 
implies that, as n — ■> , the function n log X °(n“^/Py) of y tends to 

P 

— | y J uniformly on every fixed bounded y-interval. Hence, if the existence 

of a distribution Xp satisfying (214) is known, it follows, in exactly the 

c o. -l/p 

same say as at the end of y 44> that the n-th power of X (n y) is the 

Jr* 

transform of a distribution, and that the latter tends, as n — ^ o => , 
to a distribution \x satisfying (202) . 
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In accordance with the general remark made at the beginning of § 13, 
the satisfaction or violation of (214) by a given distribution X. = X(x) 
depends, in the main, on the behavior of X(x) as x — > + <» . It turns out 
that the behavior necessary for (214) is typified by the distribution X(x) 
having the density 

(215) y (x) = C p /(l + (Cp = const.), 

if either 0<p<lorl<p<2. The remaining cases, p = 1 and p = 2, 
are taken care of by (204) and (205). It is understood that the numerical 
value of the constant Cp is determined by the condition I dX. (x) = 1. 

According to (215) and (10), the transform is 



oo 



(216) 



K (y) = 2C 



P 



j (1 t *2)-^ 



cos (xy) dx, 



and can therefore be expressed in terms of the cylinder function 

OO n 

(216 bis) (fz)% (z) = tC* P(q + 2) f (1 + t 2 ) q cos (zt) dt; q > 0. 

n J 



If x is replaced by xy for a fixed y 7^ 0, and if it is assumed, with- 
out loss of generality, that lyl = y, it is seen that the ratio of 1 - X^ (y) 
to y^ is a positive constant multiple of 



00 



v J 



_lp_l 

(1 + xVy 2 ) ~ * sin 2 Jx dx. 



Hence, in order to show that (214) becomes satisfied upon a suitable choice 
of the unit of length on the y-axis (or, what according to (17), is the 
same thing, upon the reciorocal choice c-f the unit of length on the x-axis), 
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it is sufficient to verify that the expression in the last formula line 
tends to a positive limit as y — >+0. But this becomes clear in both 

cases, 0 < p < 1 and 1 < p < 2, if the integration range 0 <_ x < oo is 
split at x = 1. 

This completes the proof for the existence of the distributions 
[ip implicitly defined by (202), where 0 < p < 2. 



56 bis. It is by no means obvious that all those distributions [ip 
exist for which the condition p < 2, imposed bjr the trivial restriction (20) , 
is not violated (cf. the beginning of §54). In order to see this, suppose 
that (202) is replaced by the assignment 

(202 bis) |i °(y) = Max (0, 1 - lyl"), 

P 

where p > 0. Then, since the expression on the right of (202 bis) is 
1 - lyl P or 0 according as |y( <_1 or I y I _> 1, the restriction (20) imposes 
again the condition p < 2 only. But it turns out that even the range 1 < p 
< 2 must now be excluded, since otherwise the distribution [ip assigned by 
(202 bis) does not exist. 

Suppose that this distribution exists for a given p. Then, since 
(202 bis) vanishes for large !y|, it follows from the criterion found at 
the beginning of § 53» that [ip has a density represented by (195). But 
substitution of (202 bis) into (195) gives 



(206 bis) 



Hp'(x) = if 



-1 



(l - t P ) cos (xt) dt. 



It follows therefore by partial integration that 

u ' (x) = a pa (x)/x if x > 0, 
P P 



(207 bis) 
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where a. (x) is an abbreviation for 

ir 



(20S bis) 



otp(x) = J t P ^ sin (xt) dt. 



*bis 



Since (206 bis) is an even continuous function, (207 A ) shows that (206 bis) 

is non-negative for - °o < x < «> if and only if (208 bis) is non-negative 

for 0 < x < o© . But (206 bis) is the density of a distribution. Accord- 
ingly, the distribution u assigned by (202 bis) exists if and only if p 

IT 

is such as to make the function (208 bis) non-negative for every x > 0* 

And it turns out that this is the case if and only if p < 1. 

In fact, (208 bis) can be written in the form 

x 



(209 bis) 



c V x) = j 



..p-i 



sin t dt, 



wnere x > 0. 



0 



If n = 1,2,..., the absolute value of the contribution of the interval 
(n - l)n < t <_ n« to the integral (209) is an increasing or a decreasing 
function of n according as the function t p_1 of t is increasing or decreas- 
ing on the half-line 0 < t <o© , i.e., according asp>lorO<p<l, 
Moreover, since (-1)° ^ sin t is positive in the interior of the n-th inter- 
val, the n-th contribution has the same sign as (-l) n h and is therefore 
positive for the first value of n. Consequently, the function (209 bis) 
becomes negative for certain values of x > 0 or is positive for every x > 0 
according as p > 1 or 0 < p < 1. Since (&D9 bis) becomes the non-negative 
function 1 - cos x in the limiting case p = 1, the proof is complete. 

It is also seen that the case p = 1 represents the elementary distri- 
bution denoted by p, 1 at the end of $ 7. If 0 < p < 1, then (209 bis) shows 
that x p a^(x) — » P (p) sin (|ucp) as x — ^ oo , which means, by (207 bis), that 
(209) becomes correct if P( p) on the right of (209) is replaced by p P(p) 

- r*(p + i). 
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Examples 



57. For every r > 0, let 0 r = 0 r (x) denote the distributions 

represented by the step-function which is 0, \ or 1 according as -** < x < -r, 
-r<x<rorr<x<o* . Thus 0 r °(h) = cos (ry), by (10). Hence, if 

r-^r^, . . . is a sequence of positive numbers, the criterion of (95) shows 
that the infinite convolution *S r is convergent if and only if the product 
"1 [cos (r n y) is uniformly convergent on every fixed bounded y-intervalj in 
which case the transform of the distribution 



(217) 



fA = *R 



L n 



is 



(218) |i°(y) = IT cos (r n y). 

Since cos t = 1 - ^t^ + o(t^) as t — ?0, it is clear that TT cos(r n y) is 
uniformly convergent on every fixed y-interval if and only if the positive 
constants are so chosen that 

(219) 51 r n 2< °°’ 

It will always be assumed that (219) is satisfied, i.e., that (217) 
defines a distribution \l. Then the product (218) is absolutely convergent 

for every u, i.e., the convergent convolution (217) must converge absolutely 
(this agrees with the remark made at the end of § 32 ). It follows therefore 
from (219) that the notation in (217) can be so chosen that 

(220) r n > r^ +1 for every n (r > 0) . 



Since the spectrum of 0 r consists of the two points x = +r, it is 

clear from the description (98) of the spectrum of an arbitrary convergent 
convolution, that a point x = s is in the spectrum S([x) of the distribution 
defined by (217) if and only if there exists to s and to every n a decision 
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of the alternative +r such that s becomes the sum of a convergent 
series of the form 

(221) s=< 2]i r n * 

This implies that S ({J.) is a bounded set on the x-axis if and only if (219) 
is refined to 

(222) r n < 

and that s = ^ then is the greatest, and s = - r^ the least, value 

contained in S(|i) . If, on the other hand, £^r n = o * , then, since r Q — ^0 
in virtue of (219), it is clear that there exists to every real number x 
and to every n a decision on the alternative +r n in such a way that x appears 
in the form Z of a (conditionally) convergent series. Accordingly, 

S([i) is a bounded set or the whole x-line according as the series (221) is 
or is not absolutely convergent. 



53. It turns out that 

(22 3) jj, is a continuous distribution for every choice of the series 

(219) which defines the convergent convolution (217) . 

According to the criterion (188) or to its equivalent formula- 
tion (187), the assertion of (223) is that (21S) satisfies the estimate 

t t 

/TTl c os(r n y))dy = o(t) or J | | cos 2 (r y)dy = o(t) as t «» . 
0 0 



( 224 ) 
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Since every factor of the infinite products n is at most 1 for every y, 
it is sufficient to verify (224) for the case in which the sequence 

is replaced by any of its subsequences, say by its subsequence 
r ]/ > t 2 X * * * * * Actually, if r-^", r ”,... denote the remaining terms of the 
sequence r ]_ > r 2 > • • • > and if U’ and |i" are the convolutions which correspond 
to these two complementary subsequences, then u = [id * |i. n , by (217), (219) 
and (92). It follows therefore from (75) that it is sufficient to prove 
(223) or (224) for the case where (217) is replaced by the convergent con- 
volution [!' belonging to some subsequence r^* , r 2 » ,... of rpT,,,.. . How- 
ever, it will be seen in ^ 60 that the continuity of the distribution u' 
is obvious if the selection of the subsequence r^ 1 ,^ 1 **** defining jx • is 
sufficiently retarded, namely such that r n » becomes greater than the value 
of the remainder series r^^ » + r n+2 ’ + . where n = 1,2,... . If this 
is granted for the present, the truth of (223) follows for every distribution 
(217). incidentally, this proof of (223) can be varied so as to be based on 
a corresponding selection in (224) ; cf . the end of § 64 below. 

According to the definition at the beginning of ^ 57, the distribu- 
tions B r are purely discontinuous. It follows therefore from (148) that the 
distribution (217) is pure. This, when combined with (223), means that 

(225) |i is either absolutely continuous or purely singular. 

Another consequence of (223) is that 

(226) S(fi) is a bounded perfect set or the whole x-axis according 
as I” r n <°° or r n = cx> . 

In fact, S ({.4 ) can, by ^ 4» be defined as the set of those points x 
in some vicinity of which the function |.i(x) is not constant. This clearly 
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ij.ipli.e3 that S([J.) must be perfect whenever u is continuous. Hence, 
(226) follows frora (223) and from what was shown at the end of § 57. 



59. At the beginning of § 36 , there was considered a sequence of 
"spaces” T-pT 2 >... consisting of respective ’’points" t^jt^, . . . . It was 

assumed that every T n carries a measure 0 n satisfying (117). Finally, it 

was assumed, after (121), that every T n carries a ^-measurable function 
x n = x^(t n ) of the position. 

An instructive realization of these requirements can be based on 
the distributions 3 r . In fact, let every T Q consist of only two "points" 
t n . Let these points be denoted by t n = + and t = *- for every n ^md let 
the ^-measure of either of these points be assigned to be i for every n 
(so that (117) is satisfied). Finally, let the function x^v^) of the 
position t n = + on T Q be defined by placing x q (+) = +r n , where r Q is a 
positive number. 

It is clear from the definition (114) of the 0^ -distribution of a 

function x (t n ) on T n that, in the present case, the (^-distribution of 

x (t) on T is 0, | or 1 according as - oo< x < - r -r < x < r or 
n n n n 

r n < x < 00 i that is, the distribution ;3 r defined at the beginning of 

n 

§57. It follows therefore from (138) that the series §7 x (t n ) , i.e., 
2 >n> converges on the infinite product space ... of the points 
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(227) t = (t^lfc,...) = (+,+,...) 
in TT 0 n -roeasure if and only if the infinite convolution (217) is con- 
vergent, i.e., if and only if (219) is satisfied; in which case the dis- 
tribution (217) is precisely the 1 \ (^-distribution of the function 
(223) x(t) : ± r n 

to which converges in \ V d> n -neasur e on the space T = T-^.Tp. ... 

of the points (227) . 

All of this sounds more abstract than it actually is. In fact, 
the infinite product space T = T 2 » •••> consisting of the points (227) 

and carrying the \ j^-measure, can be mapped on the interval 0 < t < 1, 
consisting of real numbers t and carrying the Euclidean Lebesgue measure, 
as follows: 

Let the two points, t = +, of T be thought of as representing 

n n 

the alternative which allots either 1 or 0 to the n-th place in the infinite 
dyadic expansion of a real number t satisfying 0 ^ t < lj so that 
(229) t = ZTt n /2 n and t* = |(l + 1) 

replace the points t = (+, +>...) and t n = + of T = T-^.Tp. ... and of T n 
respectively. Except for those points of the interval 0 < t < 1 which 
possess a finite dyadic expansion (and form, therefore, an enumerable set, 
and hence a set of Euclidean measure 0), this mapping of the infinite 
product space T^.Tp. ... on the interval 0 i t 1 1 is one-to-one. More- 
over, the mapping is measure-preserving, that is, the TT $ n -measure of a 
subset of T^.Tp* ’** an< ^ ^ le Euc - I -^ eaxi measure of the image of this subset 
on the interval 0 ^ t il exist at the same time, and the two measures 
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always have the sane value. This follows from (229) and from the defini- 
tion of the product measure \\ 0 n on T = T-^.T 9 . since the measure on 

T n was assigned by c i> (+) = A-. 

Accordingly, if the terms of the series (223) are thought of as 
functions of the position (229) on the interval 0 1 t < 1, then the series 
(228) converges in "Euclidean measure to a function x(t) on the interval 
0 < t < 1 if and only if the condition, (219), for the convergence of (217) 
is satisfied; in which case the distribution (217) is the ^-distribution 
of x(t) on 0 ^ t 1 1, where (p denotes the Euclidean measure. 

Clearly, (228) and (229) can be written as 

( 230 ) x(t) : |^(1 + l)r n /2 n , where t = \ 2T 0- ± l)/2 n ; (0 < t < 1) . 

If £u) denotes the greatest integer not exceeding u, it is easily verified 
that (230) is equivalent to r 

]2 n tJ 

( 231 ) x(t) : - (~!) r n J 0 < t < 1. 

In fact, the [^t"] -th power of -1 is identical with -(-1)^ for every t 

contained in the interior of the k-th subinterval obtained by dividing the 
interval 0 < t ^1 into 2 congruent subintervals (k = 1,...,2 ). 



60. Suppose that not only (219) but also (222) is satisfied. 
Then the series (231) is uniformly convergent and represents, therefore, 
the function to which it converges in Euclidean measure on 0 <, t < 1. Thus 
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(231) can be replaced by 



(232) 



:(t) = ~y~ (-1) 






V~ 



o < t < i . 



r . where > r < ca • 

n* i — n , 

In view of the identity mentioned after (231) the n-th term of 
the series (232) is discontinuous only at the poinis t = k/2 n of the inter- 
val 0 < t < 1. Hence, every term of the series (232), and therefore, by 
uniform convergence, the function x(t) itself, is continuous at all those 
points t of the interval 0 < t ^ 1 which do not possess a finite dyadic 
exoansion, i.e., which cannot be represented in the form t = t. , where 



(233) tj ffl - j/2 , where j = 1,3>..«>2 - 1 and in = 1,2,... 

The uniform convergence of the series (232) also implies that, 

at the points (233), the limits x(t + 0) and x(t - 0) exist and have the 

values which result if t is respectively replaced by t + 0 and t - 0 in 

each of the terras of the series (232) . But it is clear from the identity 

mentioned after (231) that the difference between the right-hand and the 

r 2 n t ] 

left-hand limits of (-1) at a point t = t. represented by (233) is 

Jlu 

0, -2 or 2 according as n < m, n = m or n > m. Consequently, from (232), 



m 



m-1 



co 



* (t ja + 



0 ) ~ ^ ^ ® + 2 — . 

n=l n=m+l 



This can be written in the form 

(234) x(t jm + 0) - x(t jn - 0) = 2( Vl - 2q m ), 

if q„ is an abbreviation for 

oo 

(235) % = |y +1 r n’ = 



(q o = 51 r n >- 
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6l. Of particular interest is the case of those series (222) 
for which all the jumps (234) are positive: 

(236) q m < Iq^, i.e. q m < r Q , for every m, 

cf. (235). Then the function x(t) is monotone on the interval 0 £ t < 1. 
Tn fact, whether (236) is or is not satisfied, the n-th term of the 
series (232) is constant on each of the 2 n intervals mentioned at the 
end of ^59. On the other hand, all the contributions of all the jumps 
of these step-functions to the function (232) are represented by (233) 
and (234) . Hence, if the function (232) is increasing at all of its 
discontinuity points, i.e., if each of its jumps (234) is positive, then 
the function (232) is non-decreasing at every point of the interval 
0^ t < 1. Furthermore , the function then is nowhere constant, since 
the points (233) » at which its jumps occur, are dense on the interval 
0 < t 1 1. Accordingly, 

(237) x(t) is an increasing function on the interval 0 .< t < 1, if 
(236) is satisfied. 

For a fixed m, there are 2 ra- ~ points (233) > and the jump of 
x(t) is given by (234) at each of these points. Since, as observed 
after (232), the function x(t) is continuous except at the points (233 ) > 
it follows that the sum of all jumps of the monotone function x(t) on 
the interval 0 < t < 1 is . 2 m ~^2 ( q m _ 1 - 2q ffi ), where m runs, as in 

(233) , through all positive integers. Obviously, this sum can be con- 
tracted into 2 ^q - a, if a is an abbreviation for 
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(233) 



lim 2 
n 



n+1 



V 



This implies, in particular, that the assumption (236), which, via (237), 
underlies this calculation, necessitates the existence of the limit (238). 

It is clear from (232) that x(0) = -q Q and x(l) = q Q , where 
q.Q = Hence, the total variation of the monotone function x(t) 

on the interval 0 < t < 1 is 2q Q . Since the sum of all jumps of x(t) 
was seen to be 2qQ minus the value of the limit (238), it follows by 
subtraction that (238) represents the total variation of the continuous 
component of the increasing function x(t) on the interval 0 < t _< 1. 

It will now be shown that 



(240) 



n+1 

meas S(,u) = lim 2 q i n the case (236), 

n-?o* n 



where meas S(^) denotes the Euclidean measure of the spectrum, S(fi), of 
the distribution (217). It will also be proved that 
(24l) S('-i) is nowhere dense in the case (236). 

First, the result formulated before (230), when combined with the 
definition (114) > can be expressed by saying that, even if only (219) is 
assumed, fi(x) is identical with the Euclidean measure of the set of those 
points of the interval 0 < t < 1 at which the value of the function x(t) 
is less than x, where x is any fixed real number. Hence, if the assump- 
tion, (236), of (237) is satisfied, then |i(x) is identical with the length 
of that portion of the interval 0 £ t < 1 on which the monotone function 
x(t) is less than x. Since, as abserved above, x(0) = -q^ and x(l) = q^, 
it follows, in particular, that |a(x) = 0 for - w < x < -q^ and that 

|.l(x) = 1 for q 0 < x <0° . 
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Now consider the remaining interval, -q^ i 2 i q^ } on which the 
interval 0 .< t £ 1 is mapped by the monotone increasing function x = x(t)j 
it being understood that, if t is a discontinuity point of x(t), then the 
subinterval x(t - 0) < x < x(t + 0) of the interval -qQ < x < qQ does not 
contain image points of points of the interval 0 £ t < 1. Since fi(x) is 
the length of that portion of the interval 0 J t < 1 on which the mono- 
tone function x(t) is less than x, it follows that the function ^(x), 

T/here -q _< x <_q^, can be thought of as the inverse of the function x(t), 
where 0 _< t < 1; it being understood that the jumps of the increasing 
function x(t) are mapped on intervals of constancy of the non-decreasing 
function j,i(x). 

The points, (233) » at which x(t) has a jump, (234) > are dense 
on the interval 0 j* t ^ 1. Consequently, the inverse function, |i(x), 
has no jump. Since the assumption (236) can, by (235), be written in the 
form r n > r n+ -^ + r^^ + ..., where n .= 1,2,..., this implies the fact 
anticipated in the proof of (223), if r^,r2>... is identified with the 
sequence r-^' , r^',... considered after (224). 

According to J>4 , the spectrum S([i) consists of those points x 
in some neighborhood of which the function [i is not constant. It follows 
therefore from the above interpretation of ^ as the inverse of the func- 
tion x(t), tb.at S(|i) consists of those points x of the interval -q^ <_ x £ q^ 
which are not contained in any of the subintervals x(t +0) < x < x(t - 0), 
where t denotes any of the discontinuity points, (233) » of x(t) . Conse- 
quently, the Euclidean measure of S(|x) results if the sum of all jumps 

(234) is subtracted from the length, 2q^, of the interval -q <_x < qQ. 
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This, when compared with the result found before (232), completes the 
proof of (240). 

Finally, (241) is clear from interpretation of [)> as the inverse 
of the function x(t), since the discontinuity points, (233)> of x(t) 
are dense on the interval 0 < t < 1. 



62. It is readily seen from the identity mentioned after (232) 
that, if k = l,...,2 n , the total variation of the monotone function x(t) 
on the open interval (k - l)/2 n < t < k/2 n is the same for all 2 n values 
of k and depends, therefore, only on n. On the oilier hand, it is clear 
from § 61 that 

(242) fi(x) = t if x(t - 0) < x < x(t + 0), 

whether t is or is not a continuity point of the monotone function x(t). 
These two facts obviously imply that, if the measure (240) is positive, 
then ji(x) is a constant multiple of the Euclidean measure of the common 
part, S(|.i)E x , of the spectrum, S(j.i), and of the half-line, E x , consisting 
of those points of the x-line which are to the left of the given x. Since 
|j,(oo) = 1, the constant of proportionality must, of course, be the recip- 
rocal value of the measure (240). Accordingly, 

(243) ^(x) = meas(S([J,)E x )/meas S(pt) if meas S(pt) ^ 0. 

It is easy to see that an identity corresponding to (243) holds in the 
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case meas S(p) = 0, if the Euclidean measure occurring in (243) is 
replaced by the appropriate Hausdorff measure. 

The content of (242) and of its extension Just mentioned is 
that the distribution (217) is linear on the set S((.i)j a set which is 
nowhere dense and perfect, by (241) and (226) . A corollary is that 

(244) p is, tinder the assumption (236), absolutely continuous or 
purely singular according as the measure (240) is positive 

or 0. 

In fact, it is clear from (223) that, whether (236) is or is not 
satisfied, \i is purely singular whenever its spectrum is of Euclidean 

measure 0 (cf. § A) • On the other hand, if the Euclidean measure of 
S(p) is positive, and if (236) is assumed, then (243) is applicable. 

Since (243) implies that pt is absolutely continuous, the proof of (244) 
is complete. 

The truth of the alternative expressed by (244) is by no means ob- 
vious. For instance, (244) has no analogue in the case characterized by 
the assumption that 

(245) r m 2 r n for every m, (q n = 2. r < <*>«); 

n=nH-l u 

an assumption which, in view of (235), represents the precise opposite 
of the situation represented by (236) . 

In fact, examples, to be given in § 63 below, show that, if (245) 
is satisfied, p, can be either absolutely continuous or purely singular. 






On the other hand, it is easy to see that 
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(246) S(fi) is the interval -kjq < x i q Q in the case (245). 

Since r^, the assertion of (246) is that every number s 

satisfying r p < s < Z r n admits a representation of the form 

(221). But this is clear from the assumption (245). 



63. In order to illustrate the above results, let the positive 

numbers be so chosen as to form a geometric progression, r n = r n , and 
let the resulting distribution, (217), be denoted by |X r . Thus, from (218) 

(247) M- r (y) = TTcos(r y) . 

The necessary and sufficient condition (219) for the existence of the dis- 
tribution [t is 0 < r < 1, Consequently, (222) must be satisfied. 

Since (235) gives q m = q^r®, where q Q = Z! = ( r _1 _ 1) ^ it is 

clear that (236) is satisfied by r n = r D if and only if r < \\ in which 
case ilie measure (240) is seen to be 0. On the other hand, (245) is 

satisfied by r = r* whenever r > i. It follows therefore from (241), 
n s= * 

(226) and (246), that 

(248) S(|x r ) is a bounded perfect set of Euclidean measure 0 or it is 
the interval -(r" 1 - l)" 1 _< x < (r 1 - l) 1 according as 0 < r < \ 
or ir < r < 1. 
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iA 

If k is a positive integer and if 0 < r < 1, then 0 < r <1 
and, according to (247), 

(249) (y) = H P °(y) ^(r^y) ... ^/(r^ ^y). 

In view of (17) and (53), this can be written in the form 

, N -lA -(k-1) A 

(25°) l*p^(x) = |X r (x) * |i r (r x)* ... ^ r ( r x). 

According to a classical identity, the infinite product (247) 
belonging to r = | is the elementary entire function (sin y)/y (Vieta) $ 
so that 

° - 1 

(251) (y) = y sin y. 

Hence it is seen from (10) that ^ I s a ksolutely continuous dis- 
tribution for which the density is 



( 252 ) 



u »(x) = \ if )x\ < 1 and |x »(*) = 0 if (x\ > 1. 



1/2 * ' * 1 ' ’ 1/2 
It follows therefore from (76) and from the case r = 1/2 of (250), that 



(253) H is absolutely continuous if r = 2 



-l/k 



(k = 1,2,... ). 



It is clear from (223) and (248) that 

(254) is purely singular if 0 < r < 

Since application of (250) to r = 1/4, k = 2 gives 

(255) 

and since (254) implies that the first, hence also the second, distribution 
on the right of (255) is purely singular, the truth of the assertion (77) 

follows from the case k = 1 of (253). 

It is easy to show that, in contrast to (251), 



lim sup u . 
o* r l/m 



(y) > 0 if m = 3,4,... 



(256) 



In fact, it is seen from (247 ) that 

(257) n r °(2*/r j ) = TTcos (2*r n ) if r = l/m, 
whenever m (> l) and j are positive integers. Since 2tt/r^ — > oo as 

j — »• oo , and since the product on the right of (257) is independent 
of j and has, unless m = 2, a positive value, the assertion (256) follows 
from the identity (257) . 

This situation can be imitated in the case in which l/r, instead 
of being a rational integer m, is an algebraic integer of sufficiently 
strong rational affinity. Such an algebraic integer is, for instance, the 
quadratic irrationality 

l 

(258) r = |(5 2 - l)j so that \ < r < 1, 
i 

hence l/r = §(5 2 +1) >1. Here s = l/r and -r are the roots of the equation 
- s - 1 = 0. More generally, let r, where 0 < r < 1, be an irrational 
number such that s = l/r is an algebraic integer having the property that, 
if s,t,...,z denotes the set of all algebraic conjugates of s, then 

| r | < 1, ..., | z / <1 (while s > 1, since s = l/r). Since s n + t + ... + z n 
is a real, rational integer for n = 1,2,..., there exists on the interval 

0 < 9 < 1 a number 9 such that the absolute value of the difference between 

D j] 

s and the real, rational integer closest to s does not exceed a fixed 
positive multiple of 9 n as n — > oo . Hence there exists a positive constant 
c such that 

j-1 

TT l coS (« s ”) I > c TTcos («9 n ) 

n=0 

for every positive integer j. Consequently, if 9, where 0 < 9 < 1, is so 
chosen that cos («9°) / 0 for every n, i.e., that 9 / §•, then the finite 
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product on the left of (259) is not less than the positive constant 



ca, where a denotes the value of the infinite product on the right 
of (259). On the other hand, since r = l/s, it is seen from (247) 
that, corresponding to (257), ^ ^ 

(260) |^ r (#/r ) = («) || cos (*s n ), 

n=0 



where j = 1,2,... . Finally, since r is an algebraic, and a a trans- 
cendental, number, none of the factors of (247) vanishes when y = a, 
o 

and so p r (#) ^ 0. Since the product on the left of (259) was minorized 
by the positive constant ca for every j , and since (i r °(«) ^ 0, it fol- 
lows from (260) that 




in fact, «/r J — * ©* as j — o® , since 0 < r < 1. 

According to (191) and (223), the inequality (261) implies 
that the distribution [J, r is purely singular. This is not contained in 
(254 ) t since some of the r-values satisfying (261) are outside the range 
0 < r < as shown by the example (258). 

For the same reason, it now follows from (248) that there exist 
r-valuas for which the distribution is purely singular but such that 
the spectrum S(^ r ) is an interval (the existence of such distributions 
was observed in $4). The truth of the negation (190) follows not only 
from these examples but from the simpler result (256) as well. 
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64. It turns out that the r-values of the range 0 < r < 1 
which satisfy (26l) are quite exceptional. In fact, it will be shown 
that (26l) cannot hold unless r is so chosen as to fail to possess the 
following property: There does not exist any positive number b such 

that, if djj denotes the absolute value of the difference between b/r 31 
and the integer closest to b/r®, then 

(262) djj < §(1 + l/r) - ^ as n — * 00 , 

In view of (18), the assertion can be formulated as follows: 



If 



(263) 



lira |n r °(y)l=o 

y-* + o* 



does not hold, then there must exist to r some b satisfying (262) . But 
it is known that the set of those values in the interval 0 < r ^ 1 to 
which there exists at least one b = b(r) satisfying (262) is enumerable 
(Pisot). 

It is trivial that, if l/r is rational but not an integer, then 
there cannot exist to r any b satisfying (262). Hence, what will be shown 
implies that (263) holds for every rational r not exempted by (256). 

Let r, where 0 < r < 1, be any value for which there does not 
exist any b satisfying (262). Suppose, if possible, that the assertion, 
namely (263), is false for this r. Then there exist a sequence y]_,y 0 ,... 
and a positive bound c = c(r) such that | Hr°(yj) I > c anl 7 j — ^ 
as j — * 0 ° . In view of (247), the sequence y^yp*.** can be assumed to 
be so chosen that, if k* denotes the (unique) positive integer for which 

k j h k 2 

r < y-j r <1, then the sequence y-]_r , y r ,... tends to a limit. Let 

2 k. 

J 



this limit be denoted by u; so that u 



3 



u, xi u. = y.r 



Since y. 



00 
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it is clear that k- — »o© . But (247) shows that, since u* = y.r 
J J j 

xi— 1c • 

(264) H r °(yj)= I I cos (u^r J ), 

where j is arbitrarily fixed. Since | [t r °(y..) | > c > 0, the absolute 

value of the infinite product on the right of (264) exceeds a fixed 

positive constant. Hence, if this product is treated in the same way as 
the product (260) was treated, it is seen by a straightforward adaptation 
of the proof of (26l) in § 63, that the limit relations u. — ^ u and k- — 4 00 

J J 

contradict the assumption, according to which r is such that there does 
not exist any b satisfying (262). 

This proves, in particular, that the r-values satisfying (263) 
are dense on the interval 0 < r < 1, and therefore on the interval 

0 < r < J. Hence, the truth of the negation (191) follows from (254) • 

On the other hand, the algebraic r-values of the type (258) and 
the rational r-values specified in (256) do not satisfy (263). Further- 
more, it is clear from (249) that, if (263) does not hold for a given r, 

Si 

say for r = r^, then (263) cannot hold for any r of the form r = Tq , 
where a is any rational exponent greater than 1 (so that r^ a < r g) • It 
is not known whether the r-values satisfying (26l) do or do not cluster 
at the upper bound, 1, of the range 0 < r ^ 1. 

Tile above examples imply that the decision of the alternative 
represented by (26l) and (263) depends on delicate arithmetical properties 
of r. It is therefore of interest that, if H- r °(y) is replaced, as in (224), 
by its square ~ average, then all the Diophantine intricacies disappear. 

In fact, it will now be shown that 



( 265 ) 



f, o |2 ,Vlog 2 r 

J I fV (y)l d y = 0(t ) as t — > 

0 

holds for every r£j. Since the logarithm in (265) is of base 2, the asser- 
tion (265) is trivial from (224) if r = The remaining case, where the 
exponent on the right of (265) is negative, is precisely the case in which 
S(n r ) is of Euclidean measure 0; cf. (243). And it is readily seen that 
the Hausdorff dimension. of. S(^) , referred to after (243) , is exactly the 
one corresponding to the exponent log^r mean estimate, (265), of the 

transform o£ [l r . In view of (251) and (252), this parallelism holds in the 
limiting case r = \ also. Correspondingly, it is easy to show that the expo- 
nent on the right of (265) cannot be improved for any r < J. 

In this connection, it is worth observing that both the proof and 
the assertion of (148) remain valid (without any change), if the notions 
of an absolutely continuous and of a purely singular function are referred 
to any Hausdorff measure, rather than, as in $4, the specific Euclidean 
measure. 



In order to prove (265), choose a positive integer k and omit all 

but the first k factors of the infinite product (247). Then, since 

|cos^u \ < 1 and cos u = J(e' i ' U + e iU ), 
t t 

j) | M- r °(y)/ 2 d y < 2 _2k f | 2L e:sp i(+r ±r 2 + ... + r k ) y / 2 dy, 

0 ” k 

where the summation refers to all 2 variations of the k alternatives 

I I 2 

+»•••»+. If the integrand, on the right, | | , is written as a 



product of complex conjugates, then it appears as a linear combination of 
terms of the form exp (icy). The various values of c, along with the 
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number of terms belonging to a fixed value of c, can be obtained by an easy 

counting. Since the assumption 0 < r < \ implies that the sum of the geo- 
ro+1 m+2 m 

metric series r + r + ... is less than r for every non-negative in- 
teger m, it then follows from the last formula line, that 

t ^ 

, Pi o, n i 2 —2k k , "> k— j — j 

(266) I J[.i r (y)| dy < 2 const. (2 t + 2 k /_ 2 r ), 

o 3=0 

where the constant depends only on r. In fact, if exp (icy) is integrated 
between y = 0 and y = t, the corresponding term acquires the factor t or 
the divisor c according as c = 0 or c ^ 0. Since 2r < 1, it is clear from 
(266) that 

t 

r, ,2 _k k 

J I ^ r °(y) I d y < 2 Const, (t + l/r ), 

0 

where the constant depends only on r. Now let k be made a function of t, 
namely the integral part of the quotient (log t)/(-log r) . Then the last 
inequality becomes identical with (265), where 0 ^ r < 



65. Hone of the above distributions is absolutely continuous 
but such as to have a nowhere dense spectrum. Such an example results by 
choosing 

(267) r n = 2~ D + 3" n 

in (217). In fact, it is seen from (235) that (236) is satisfied, and that 

the limit (240) is distinct from 0, in the case (267 ) . Hence the assertion 
follows from (244). 
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Still smoother distributions can be obtained from the identities 
(249) and (251). In fact, these identities imply that 

(268) H r °(y) = 0(|y| k ) if r = 2 ^ t 

where k = 1,2,... . It follows therefore from the criterion (197), that 

the distribution \i has at least k - 2 continuous derivatives for every x 
-l/k 

if r = 2 ' . These r-values tend to 1 as k > o° . But r = 1 is ex- 

cluded by (219), and it is not difficult to prove that none of the distri- 
butions has derivatives of arbitrarily high order for every x. 

In order to obtain distributions (217) possessing this, or even 
a higher, degree of smoothness, denote by X & the distribution which results 
if r^ in (217) is chosen to be l/n a ; so that, from (218), 

( 26 9) ^ a °(y) = "\Tcos(y/n a ), where a > \ , 

by (219) . It will be shown that each of these distributions X (x) has de- 
rivatives of arbitrarily high order for - < x < °° ; that X & is regular- 

analytic and bounded in a strip containing the x-axis, if 3f < a is replaced 
by J < a <. 1; and that X is a (transcendental) entire function, if \ < a < 1. 

Q. 

If 1 < a, then X cannot be regular-analytic at every point of the 
8 . 

line -o»< x <c*> . In fact, if 1 ^ a, then (226) shows that S(X ) is a 

S. 

bounded set, since r = l/n . But the derivative X * (x) vanishes at every x 

Q. 

not contained in the spectrum, and so X ’ (x) cannot be regular-analytic at 

Si 

the points x = + £_l/n , for instance; points which, according to (221), 
are contained in the spectrum. Since the spectrum is situated between these 
two points, it also follows that all derivatives of X & (x) must vanish at 
these extreme points, although X & (x) is not constant there. 
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In order to prove that \ & possesses the smoothness properties 
stated before, it will first be shown that there exists an absolute con- 
stant C satisfying 



(270) |n°(y)| = TTl cos(r y)| < exp(-Cy 2 — r ) 

r a <Vl7l 

o 

for every y, where (y) is the transform, (218), of an arbitrary distri- 

x. 

bution (217) . The existence of the function^ on the right of (270) is 
assured by (219). 

Since |cos(r n y)| <1, the absolute value of the infinite product 
(270) is not increased if some of the factors are omitted. For every fixed 
positive y, omit all those factors in which r n is not less than l/y. Then, 
if a positive constant B is chosen so small that 0 < cos t < 1 - Bt^ for 
0 < t < 1, it follows that 



2 V ' 2 

|H°(±y)| < ~n~ (1 _ Br y 2 ) = exp log(l - Br y 2 ) 

r n <i/y n r n <i/y 

holds for every y > 0. This proves the truth of the inequality for a cer- 
tain positive constant C = Cg, since log(l - t) < -t for 0 < t < 1. 

2 

If [i = \ a , where a > then the sum multiplying -Cy on the right 
of (270) is o* 



l/n a <l/|y I 



/ a»2 ^ -2a -2a 

(l/n ) = Z — n J , Ti du 

, * . a , ,V a 



I y I <n a 



yl 



as y — $>+<». Since the last integral is const.^yj 



-2+1/a 



Tdiere const. = 



-1 



(2a - 1) >0, it follows from (270) that there exists to every a >| a 

positive constant c satisfying 

8. T / 

o ^/ a 

(271) | \ (y) | < esq? (-c |y | ) for every y. 



Consequently, the smoothness properties stated after (269) follow from the 
corresponding sufficient criteria, (198) and (200), of $53. 
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According to (226), the spectrum of the distribution [L = is 

the whole line -oo< x < oo exactly in the case, a <_ 1, in which X a (x) is 
regular-analytic along this line. However, the distribution (217) can be 

so chosen as to be purely singular and still such that its spectrum is the 
whole line. According to (226), (225) and (191), a distribution (217) will 
certainly be of this type if it belongs to a series (219) for which the 

series ^EIr n diverges but, as y— > + oo, the product (218) does not tend 
to 0. And these conditions are satisfied by the sequence r i, r 2>*** which 

consists of the values 1/2 ’, where k = 1,2,..., and contains the k-th of 
these values exactly 2 k * times. In fact, the series ^_r n 2 and 5Ir n 

then become X2 k! /(2 kI ) 2 = ^.l/2 kI < oo and ^L2 kl /2 kl = 1 = oo 



respectively, and so it is sufficient to show that the corresponding 
product (218), namely 



,k» 



(272) fi°(y) = TTcos 2 (y/2 *), 

does not tend to 0 as y — > + oo * But 

I !* 0 (2 m! *) | = IT Ice /' C2" ! ' k! «)|. 

k=m+l 

And it is easily verified that the value of the last product does not tend 
to 0 as m — => oo , the situation being about the same as in (260), (26l), 



y 
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66. The general fact expressed by (225) concerns itself 

with the local behavior of a distribution (217) belonging to an arbitrary 
series (219). As a corresponding general fact concerning the behavior of 
the distributions (217) in the large, it will now be shown that, for an 
arbitrary series (219), the distribution |*(x) must tend, as x — ?+ <=>° 
to f*(+°*) = iKl 1.1) so rapidly that all of its moments are finite and, 
what is much more, 

2 

( T)X 

(273) d|.i (x) < 00 for every p > 0. 

First, the spectrum of the distribution P r , introduced at the 
beginning of £ 57, is a bounded set. Hence, the same holds, by (7l), for 
the distribution (x n defined by 



(274) 



* P r 3 
1 n 



where n and the positive numbers r-^,...,r n are arbitrary. Thus it is clear 



from (30) that all moments of the distribution |j, n are finite. The numerical 

o 

values of these moments can be calculated by observing that, since (y) = 
cos (ry) for every y, the transform of the distribution (274) is the product 
cos(riy) ... cos (r y), by (82). It then follows from (32) that the 2h-th 



n 



h n 



derivative of this product at the point y = 0 is (-1) (|i )^ for every non- 



negative integer h. Accordingly, 
(275) (l^K = ( 



2j 2k 2m 

r n r 0 ... r 



2h ' ' 2j 2k ... 2m 1 2 n 

where the summation, (Z...X ), extends over all sets (j,k, ...,m) of non- 
negative integers subject to the restriction 2j + 2k + . . . + 2m = 2h, i.e., 
j+k+...+m = h. It is understood that the C denote the corresponding 






151 



multinomial coefficients: so the coefficient CL. __ _ results if in 

2J 21c • • • 2m 

(276) C. , = O' + k + ••• + ml), where j + k + ... + m = h, 

j, k, ..., m and h are replaced by 2j, 2k,..., 2m and 2h respectively. 

It is readily seen that C«. „ < h C . Hence, 

2j 2 k • • • 2m — * j • • • m 

jj ll 

(275) shows that (|X ) is not greater than h times a sum which, according 

to the multinomial theorem, is exactly the sum representing the h-th power 

of r-, + ... + r n ”. Consequently, ([X ) < h A > if A denotes the sun of 

h h 

the infinite series (219) . Since the bound h n A is independent of n, and 

since [i* — > |x as n — > o> holds in view of the definitions (274) and (217), 

h li 

it is now clear from (5*) that (}x) < h A . 



This means that ^(x) < (hA) , 7 rhere h s 0,1,... . It 

t _ 



follows therefore from (4*)* by placing t = px^ in = 1 + 21 1 /hi, that 
the value of the integral (273) belonging to a fixed p > 0 cannot exceed 
1 + 2~ (phA) /hi and must, therefore, be finite if the positive series 
2__ (Bh) /hi, where B = pA, is convergent. But Stirling’s formula shows 
that tli is series is convergent if B < l/e. Consequent^, the integral (273) 
must be finite if p < l/(Ae). 

In order to complete the proof of (273) > it remains to be shown 
that the restriction p < l/(Ae) is superfluous. But A denotes the sum of 
the convergent series (219). Hence, l/(Ae) can be made arbitrarily large 
if the sequence r ]_> r 2 >*** is replaced by r n+ ^,r + 2 ,. .., where n is suffi- 
ciently large. Then the distribution (217) becomes replaced by the distribu- 

n 

tion |i n+1 * u ^ • If the latter is denoted by [Xjj, then (X = jx * (X n , 

where |x n is the distribution (274). Bub P- n has a bounded spectrum for 
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every n, and so the integral (273) belonging to [1° is finite for every p. 

Since the integral (273) belonging to a given p and to |A n was seen to be 

n 

finite when n is sufficiently large, the truth of (273) for |i = jj, * |a 

follows by observing that the relevant criterion of § 22 is applicable to 
2 

f (x) = e^ x {for every fixed p > 0) . 



67. The above proof of (273) is to a large extent independent 

of the assumption (217). In fact, the proof remains valid without any 

change, if (217) is replaced by any convergent convolution ja = *|A n in 

which every u has moments of arbitrarily high order and (u ) is less 
r n r n 2h 

than the h-th power of a fixed multiple of (|A^) for all n and for all h, 
finally |A n is symmetric for every n. 

A distribution (A is called symmetric if it is identical with the 
distribution ^|A defined before (17). For instance, an absolutely contin- 
uous ]A is symmetric if and only if it has^density [A* satisfying [A*(x) = fA*(-x). 

It is clear (18) and (19) that a distribution [A is symmetric if 
and only if the transform y,°(y) is an even and/or real-valued function of y 
In fact, (10) becomes 00 

+ J 



(277) 



H- (7) = l- t ( +0 ) " H(-0) 



+0 



cos(yx) dy(x). 



Correspondingly, (185) can be contracted into 



(278) 



y/x+O) + [A (x— 0) _£ 2 J* sjn(xy) 



y (7) dy. 






153 



and so (l95)> If (195) is applicable, into 

09 

(279) [l ' (x) = Jcos(xy) n°(y) dy. 

0 

o 

Since |A is symmetric if and only if |J. is real, it is clear from 
(53) and. (28) respectively, that the convolution of two symmetric distribu- 
tions and the limit of a convergent sequence of symmetric distributions 
are symmetric distributions. 

Finally, it is clear from (30) that a symmetric \i cannot possess 
a non-vanishing moment of odd order. However, this does not imply that 
must be symmetric if all of its moments of odd order exist and vanish; cf. 

$ 18. 



68. In applications of the type exemplified by the theory of 
errors, of particular interest is the case of those densities of probability 
which are monotone functions of the distance from the origin. In order to 
delimit a corresponding class of distributions in such a way that the class 
becomes closed under the limit process specified at the end of § 2, the 
following definition turns out to be appropriate: 

Let a distribution ja be called convex if it is symmetris and 
such that the curve \x = jj,(x) in an (x,p.) -plane is convex from below on the 
open half-line -o»< x < 0, and therefore convex from above on the open 
half-line 0 < x <o> . Since is monotone, hence L-measurable , it follows 
from the theory of convex functions that, if u is a convex distribution. 
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the function pt(x) is absolutely continuous on every open interval not 
containing the point x = 0; a point at which pi can have a jump, as exem- 
plified by the convex distribution (15). It also follows from the theory 
(Jensen) 

of convex functions A that pi has a finite right-hand and a finite left-hand 
derivative at every x ^ 0; that both of these derivatives are monotone 
non-increasing functions of | xj ; finally, that both of these derivatives 
are continuous at exactly those points x ^ 0 at which they have a common 
value (which implies that, since both derivatives are monotone, the set 
of points x at which pi has no ordinary derivative is enumerable) . Both 
derivatives, being monotone, must tend to 0 as x — > + since J dpi(x) <©*>. 

But the derivatives can tend to + as x — * +0. 

If a sequence of func tions which are convex on an open interval 
tends to a limit at every point of this interval, then the limit function 
obviously is convex there. This implies that, if pi n — pi holds in the sense 
defined at the end of § 2, and if every pt^ is a convex distribution, then (A 
is a convex distribution. 

This is the closure property of the class of convex distribu- 
tions, referred to above. It holds only in virtue of the above definition 
of the class, since pi can have a jump at x = 0 when every pi n is continuous 
at x = 0. On the other hand, it is clear that there exists to every convex 
distribution pi a sequence of convex distributions pi n such that every p^ has 
continuous derivatives of arbitrarily high order for every x, and pi^— s pi. 

It will now be verified that, if two distributions, say pi and X, 
are convex, then pi * 1 is a convex distribution. 
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In view of (56), it is sufficient to prove this for the case where p and X 
have continuous second derivatives. Then, since p and X are convex, p'(x), X'(x) 
are even functions which do not increase when (x) increases, i.e., the second 
derivatives p"(x), X" (x) are odd functions which are non-positive for x > 0. 

On the other hand, (81) shows that p * X has the density ^’(x - u)X'(u) du, 
and therefore the second derivative Jp n (x - u)X'(u)du. Since p* 1 is odd and X 1 

is even, this integral, where - o©< u <o° , can be written in the form 

-h co 



Jx'(x - u)p n (u) du = p"(u) ^X'(x - u) - X'(x + y)J- du. 



- CP 



if it is assumed that x > 0. But this implies that the function p"(u) is non- 
positive, and that, since jx-u|<x+u, the function 1 1 multiplying p n (u) 
is non-negative, and so the integrand ^"(u) £ is non-positive, on the whole 
integration domain 0 < u < o° . Accordingly, the function p * X of x has a non- 
positive second derivative for every x > 0. Since p * X, being a convolution 
of two symmetric distributions, is symmetric, it follows that p * X is a convex 
distribution . 

As an illustration, consider the distributions p defined by (202), 

P 

where 0 < p < 2. If p = 1 and p = 2, then (204) and (205) show that p is con- 
vex. In the remaining cases, p was obtained, in § 56, by taking a fixed dis- 

tribution X n , changing in it the unit of length on the x-axis, forming con- 
volutions of the resulting distributions, and finally applying a limit process. 
Since all of these operations preserve convexity, it follows that Pp is con- 
vex if Xp can be chosen to be convex. But the distribution Xp used in § 56 
was the one having the density (215), which is a decreasing function of | x |. 

Consequently, all the distributions p n 
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defined by (202) are convex. 

Since the density (252) defines a convex distribution, it follows 
by the above argument that, if q-j^q^, . . . is any sequence of positive numbers 
for which the infinite convolution *n n , where ^(x) = (j, ^(q^x), is conver- 

gent, then the distribution *y, n is convex. If q n = (n - i) -1 , it is easily 
verified from (251), (17) and fron the canonical products of the even entire 
functions cos z, z~ sin z, that the function \| |A n (y) , i.e., the transform 
of the distribution is identical with the function y\ cos(y/n). Conse- 

quently, the distribution defined by the case a = 1 of (269), is convex. 
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Proj ections 

69. At the beginning of § 34> there was introduced a "space” 

T of "points" t. It was assumed that there is given a field of sets F 
of points t such that every t and T itself are in the field. Finally, it 
was assumed that there is assigned to every set F in the field a value 
0(F) in such a way that 

(280!) 0(F) >0; (280 2 ) ^0(F n ) = 0( Z_F n ); (28O3) 0(T) = 1, 

where F!,F2»... is any sequence of mutually disjoint sets F. 

Xn the sequel, T will be chosen to be the product space 

*1 *k> where k is a fixed positive integer and denote 

the lines - ©»< X! <0® , . .., - co< x^ <©p . In other words, T will be the 
k-dimensional Euclidean space, and t any point (x^,...,xj c ) of this space. 

The field of the sets F will be chosen to be the Borel field on T. Finally, 
any function 0 satisfying the three conditions (280^), (28O2), (28O3) will 
be called a k-dimensional distribution, or simply a distribution (on T) . 

If k = 1, all of this reduces to the situation introduced in § 1. 
However, unless the contrary is stated or implied, it will be assumed that 
the fixed value of k is at least 2. 

The spectrum S(0), the point spectrum P(0) etc. can be defined 
in exactly the same way as in § 4> with the understanding that "interval" 
now means "k-dimensional interval” (i.e., a set of all points (x-^,...,xj c ) 
such that Xj is between a^ and bj, where a^ < bj and j = l,...,k). Cor- 
respondingly, the absolute continuity of 0 means the existence of a (non- 
negative) Baire function f = f (x!, . . . ,x^) having the property that, for 
every Borel set F, the value of 0(F) is identical with the integral of 
f dx! ... dxjj over F; in which case f is called the density of 0. 
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The symbol 0 n — ^0 should be defined in exactly the same way as 
> \x is defined at the end of §2. The notion of the convolution of 
two distributions, a notion depending on the existence of the translation 
group, can be transferred from k = 1 to the case of an arbitrary dimension 
number k. Correspondingly, the definition (10) of the transform of a dis- 
tribution becomes 

(281) 0°( yif«>7k) = J... J* exp i(x 1 y 1 + ... x k y k )drf>, 

where (y-]_, . . • ,y k ) is any point of a k-dimensional Euclidean space? it being 
understood that the integral (281) is a Borel-Lebesgue integral with refer- 
ence to the measure 0 on the (x^ f • • • ) -space, and that the integration 

is extended over the whole of this space. A glance at §7 - ^ 10 shows that 
the proofs of the theorems of concerning the uniqueness, compactness, and 
continuity of the transform remain valid for any k. And the whole theory 
of infinite convolutions, including^iieF semi -groups, can be copied word by 
word. 

The moments of the first and the second order are of course defined 
by 

(282) (0)^ = J*... y Xjd0 and (0) hj . = j\.. ^ *^<30, (j,h = l,...k), 

if y... j' (| x^\ +...+ 1x^1 ) d0 <00 and I — K 2 + — + x^. ) d0 <o» re- 
spectively. In the first case (and therefore, in particular, in the second 
case), the point (x 1 ,...,x k ) = ((0) 1 ,...,(0) k )of the (x ] _,...,x k ) -space is 
the "center" of the "mass distribution" represented by 0. And 0 is called 
centered if its center is the point (x^...,^) = (0,...,0). la the second 
case, it is seen from (280^), (281), (282) and from Taylor’s formula for 
functions of k variables, that 
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(283) f (yi,....y k ) = i + i {te) l7l + ... (<^) 7k J . j^( yi ,...y k ) 

, 2 2 
+ o(y 1 + ... + y k ) 

2 2 

as + ... + y k ^ 0, where denotes the quadratic form 

k k 

(234) V y l , *“ ,y k^ = H X (0) t 

h=l j=l 



'hj y h y j* 



According to (282), this quadratic form is identical with 
2 

( x qy-j_ + ... + x^,y k ) d0. This obviously implies that the form 
(284) is non-negative definite and that its rank is identical with d, if 
the spectrum S(0) is contained in a d-dimensional, but is not contained 
in a (d - l)-dimensional, Euclidean space through the origin of the 
(x^, . . .,x k )— space (it being understood that d = 0 corresponds to the ’’unit 
distribution”, for which S(0) consists of the origin). In particular, (284) 
is positive definite if and only if d = k. 




70. The existence of the moments (282) will not be needed in 
the sequelj so that 0 is an arbitrary distribution on the (x-^, .. .^x^) -space, 
where k has a given value greater than 1. 

Let X: -o*< x <0^ be an oriented line through the origin of 
the (x]_, .. .jX^.) -space. For a given x, let F denote the k-dimensional, 
open half-space consisting of those points t = (x^,...,x k ) for which the 
x-coordinate of the point t^ is less than the given x, where t^. denotes 
the orthogonal projection of t on X. Then it is clear from (280^), ( 28 O 2 ) > 
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x 

(280^) that the function n(x) = 0(F^ ) satisfies the three conditions (8) 
characterizing a distribution on X. This one-dimensional distribution will 
be called the projection of the k-dimensional distribution 0 on X, and it 

will be noted by 0=0 (x) . 

X x 

Thus 0 7 °(y) = J'e^ x ^ r d0(F^. ), by (10) . Hence it is readily veri- 
fied from the definition (28l) and from Fubini's theorem, that 
o o 

(285) 0jr (y) = 0(a 1 7>...»a^y); - oo< y < 0 ° , 



where denote the direction cosines of the oriented line x. 

It is clear from (285) that 

(286) 0^ (y) — 0 (y,C, . . .,0) , . . ., 0^ (y ) = 0 (0,...,C,y), 



where X-^y^X^ denote the coordinate axes in the (x^, ... ,x^) -space. Heed- 
less to say, the k functions of the single variable y do not determine the 
function (28l) of k independent variables. In other words, distinct 

k-dimensional distributions 0 = 0(F) can have one and the same set of pro- 
jections 



(287) 0 (x), 

1 



0 7 (x) 



on the respective coordinate axes X^,... X^. 

(On the other hand, if the projection 0^(x) of 0 is given for 
every X, i.e., if the continuum of all functions (285) belonging to a fixed 
function (281) is known, then the latter, and therefore the k-dimensional 
distribution 0, is uniquely determined. But not even this seems to be 
trivial. Its truth can be concluded from the fact that the solution of a 
certain integral equation of Abel's type is unique.) 
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71. Let R denote a rotation of the (z-^, . . .,x k ) -space about the 
origin, and let RF be the Borel set into which a Borel F is turned by R. If 

a distribution 0 has the property that 0(F) = 0(RF) holds for every F and 



for every R, the distribution 0 will be called of radial symmetry. 

Since the scalar product x-^y^ + . . . + x^y^. remains invariant when 
the (y- L ,...,yi c ) -space is subjected to the rotation R -1 inverse to the ro- 
tation of the (x^, . . •» x j c ) t and since every distribution 0 is uniquely de- 
termined by its transform (281), it is clear that 0 is of radial symmetry 
if and only if the function (281) can be expressed as a function of the 
single variable 



2 2 j 

(288) q = (y^ + ... + y^ ) 2 , where ( )^ > 0. 

On the other hand, the projection 0 = 0^(x) of 0 on the line X becomes in- 
dependent of the choice of the line X, and can therefore be denoted simply 
by (J. = [ Correspondingly, Idle transform of 0^ = becomes independent 
of the direction cosines of X. Consequently, from (285) and (288), 

(289) (q) — 0 (y^, . . . »y k ) . 

If a point t = (x-2 _,...,x^), where k > 1, is thought of as a 
Borel set, and if 0 is of radial symmetry, the three condl tions (280^), 
(280 2 ), (280^) obviously imply that 0(t) = 0 unless t = (0,...,0). In 
other words, the point spectrum, P(0) is either vacuous or consists of 
the origin. Thus, if 0(0) denotes the value of 0(F) when F is the point 
(0 ,..., 0 ), the distribution 0 is or is not continuous according as 0(0) = 0 
or 0(0) >0. In either case, it is clear from the definition of the pro- 
jection | J.0 = V-Ax), that 
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(290) |fy(+ 0) - H (-0) = 0(0). 

But it is also clear that the distribution \x^ is symmetric in the sense 
defined in §67. Consequently, from (277), 

CP 

(291) ^0°(9) = 0(°) + 2 I' cos (qr) d[i^(r) . 

+0 

Let p denote that distribution of radial symmetry on the 

(x^, . ..jX^) -space for which the spectrum S (0) is the sphere 
2 2 

X 1 + .v. + =1. Since the spectrum S(p), being a (k - 1) -dimensional 

manifold, is of Euclidean measure 0, and since the point spectrum P(8) is 

vacuous, the distribution 8 is purely singular. The distribution p will 

2 2 

be called the equidistribution on the sphere x^_ +>.. + Xj c =1. The 

2 2 

equidistribution on the sphere x x + ... + x k = r 2 0 f radius r is defined 
Similarly; it will be denoted by p^. (This notation agrees with the one 
used at the beginning of § 57 in the on e-dimen siDnal case; a case which 
is now excluded.) Thus, from (291), 

( 292 ) ( a -) = (rq)> 

' r p 



where p = (3-^ 

(293) 



and 

H p °(q) = 2 



o© 

f COS (qr) d}Ap(r). 
+0 



For an arbitrary distribution 0 of radial symmetry, let X^(r) 



denote the value of 0(F) when F is the interior of the sphere S(P r ), i.e., 

the domain x^ +...+x^ <r^. Thus \^(r) is a non -decreasing function 

satisfying 



(294) \0(+O) = 0(0) and X ( <y> ) = 1; 
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cf. (290) and (280^). If the distribution 0 is thought of as a strati- 
fication of the possible jump (290) and of the spherical equidi stributions , 
it is seen from (289) and (281) that, in view of Fubini’s theorem, 

< 295 ) 1*0° fa) =0(0) + f °(q) dX.(r). 

+0 ' T' r 



72. The relations (291) and (295) are two geometrical decom- 
positions of the transform (281), represented by (289) and (288), In fact, 
the distribution [A^(x) was defined in § 71 as the projection of 0 on a 
line -c*< x <00 through (x-^,...,^) = (0,...,0); so -that (^(x) is a 

symmetric distribution which is continuous except for a possible jump, 

. . holds 

(290) , at the origin, and so ^(r) = 1 - ^(-r)„for every r > 0 (cf. the 

definition of |x in § 6). Accordingly, the difference ^(r) - ^(-r) is, 
on the one hand, identical with 2|a^(r) - 1 and represents, on the other 
hand, the value of 0(F) for that Borel set F which is the h-dimen sional 
strip contained between two parallel hyper planes tangent to the sphere 
of radius r about the origin. Hence, if the factor 2 in (291) is moved 
beneath the integral sign, it is seen from 26 .^^) = d - lj that 

(291) represents the decomposition of the transform (289) into cylindrical 
waves. On the other hand, the definition of X^(r) at the end of § 73 



shows that (295) is the decomposition of the same transform into spherical 
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waves. Originally, the transform was given by the k-fold integral (281) 
in the form of plane waves. 

Either of the weight functions, 2\x^(r) - 1 and ^(r) , of the 

cylindrical and spherical stratifications determines the other uniquely. 

The explicit connection between these two functions is 

o° 

(296) 2^( r ) - 1 = X^(40) + Jh (r/p) dX.(p); 

+0 p 

cf. (294) • In fact, it is clear from (292) and (10) that |ip(x/p) is the 

2 2 2 

projection of the equidistribution carried by the sphere x^ '+...+ x^ =p, 
since ^(x) is the projection of the equidistribution carried by the sphere 

2 2 

*1 + — + *k =1. Thus (296) is evident from the geometrical interpre- 

tation given before, the two evaluations being identical in view of Fubini's 

theorem. Correspondingly, (296) follows from the uniqueness result of § 7 

also, if (296) is substituted into (291) and the resulting representation 

o 

of (q) is compared with the one which results by substituting (292) and 
(293) into (295). 

The functidn occurring in (296) can be given explicitly, as fol- 

i(k-3) 



lows: 

(297) |X g (r) = 



i + b. 



4 a - 



dx if 0 £ r < 1, 



1 if 1 < t <o» , where B 1 , = » a P (7*:)/ P (yk - l), 

fiS A * 

k being the dimension number. In fact, B = 8(F) is the equidistribution 

2 2 

on the sphere x^ + ... + x^ = 1, and [x^(x) I s ihe one-dimensional distri- 
bution representing the projection of 3 on a line - Oo < x <<3® through 
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(x^*...*^) = (0,...,0). Hence, d|4.p(x) is proportional to the (k - 1)- 

diinensional area of that portion of the sphere which is situated between 
the two hyper planes perpendicular to the x-line and proceed by the latter 
at the points x and x + dx. Thus d^Cx) =0 if either - o* < x < -1 or 
1 < x , while d| 4 ,p(x) = const, sin^ ^ 0 d0 if -1 < x < 1, where 0 de- 

notes the azimuth in the (x-^, .. ,,x k ) -space with reference to the negatively 
oriented x-axis; so that x = - cos 0. Since this gives d|ig(x) = const. (l - x^) 
for -1 < x < 1, the assertion (297) follows, except for the numerical value 
of const. = B^. But the latter is determined by (3)j so that, correspond- 

ing to (215. bis), 1 

r 0 %(k-i) 1 

(297 bis) 1^ = 2 ] (1 - x 2 ) dx = P (|k - -I)/ r dk ) . 

0 

Substitution of (297) into (293) gives 

o 1 0 §(k-l) 

(292) ^ (q) = 2B k j (l - r 2 ) cos (qr) dr. 

0 



This corresponds to (216) and can, corresponding to (216 bis) , be expressed 



in terms of the cylinder function 

1 



(298 bis) 



(|s) h 



a 



1 

r 



r ft + j) 



h-i 



(1 - s^) cos (zs)dxj h > -§■. 



In fact, h = Jrk - 1, and so, from (297 bis), 



(299) 



pk-1 



^ (q) = 2 " P (§k) Ja^CoJ/q^ 1 . 



(incidentally, the function on the right of (299) becomes cos q in the case 



k = 1 considered at the beginning of § 57; a case which is now excluded.) 

If k = 3, then (299) is reduced to the function (251), where y = q. If 

o 

k - 2, then (299) becomes just |ig = J . 



t\3|H 
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i 

"“2 

Since J-^(q) = 0(q ) as q — > cp holds for every h > -§•, it follows 

from (299) that 

(300) Up (q) = 0(q * 2 ) as q * zp . 

The estimate (300), when compared with (297), illustrates the conclusion 
drawn from (197) in § 53. 

If (299) and (292) are substituted into (295), where 0 is any 

k-dimensional distribution of radial symmetry, it follows that 

o » 

(301) U, (q) = 0(0) + p (Jk)(Jq) " k f r 1 (qr)dX.(r) . 

+0 2 J£ ~- l ^ 

Accordingly, the k-fold integral (28l) is reduced by (239) and (288) to the 
integral (30l). Incidentally, (290), (291) and the parenthetical remark 
following (299) show that (301) is true in the excluded case k = 1 also, 
since then 0 = in virtue of § 5. 

In view of (29l) and (293 bis), the relation (30l) can be interpreted 
as the harmonic analysis (Fourier transform) of the connection between the 
two functions of stratification, [A^(r) and ^(r), belonging to a distribu- 
tion 0 of radial symmetry. This connection, namely (296), can be written 
in a form as explicit as (301). In fact, }Ag(+0) = \ and |j,g(oo) = by 
(297). If this is combined with (294), a partial integration shows that 



the sum on the right of (296) is identical with 

OO Co 



i- i: 



J^0(p)dp^(r/p), i.e., \ - Bjj. j\^(p)(l - t ? /?) (-r/p^dp. 



2 , , 2 , 



hy (297) ; so that (296) appears in the form 



( 3 02 ) 



- 3 

2 V 



=f(, 



(p 2 - r Z ) i(k ' 3) 
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In view of (8), the numerator of the quotient on the left of (302) tends 
to 1 as r — ^ o® . 

In connection with (302), cf. (310) below. 

If the function is given, the function follows from (302). 

In the converse problem, where \x^ is given, (302) represents a functional 
equation for the unknown function X^. It is clear that the solution of this 
functional equation can be reduced to a differentiation if k = 3» and to 
Abel's integral equation, that is, to a differentiation of order |, if 
k = 2. And it is easy to verify by successive partial integrations and 
by differentiations of integral order, that the solution of (302) in case 
of an arbitrary k can be reduced to the case k = 3 or to the case k = 2 
according as k is odd or even. Correspondingly, the function (298 bis) 
occurring in the harmonic analysis (301) of (302) can, by successive 
differentiations of integral order, be reduced to the elementary function 
or to the less elementary function (resulting from by a differ- 
entiation of order -§) , according as k is odd or even . 

Needless to say, the integral equation (302) cannot have a non- 
negative. non-decreasing solution l^(r) satisfying ^(o*) = 1» unless the 
given function, ^(r), is such as to represent the projection of a k- 
dimensional distribution of radial symmetry. 
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73. Let f(t) = f be a real -valued Baire function of 

the position on the k -dimensional Euclidean space T = X^ on 

which there is given a distribution, that is, a set-function 0 satisfy- 
ing (280^), (2802)* (280^). The function f can be undefined or + Qo 
at some points (x^, . . . However, the set, say F^., of these points 

should be of 0-measure 0, i.e., such that 0(F^) = 0. Then § 34 assigns 
to f a 0-distribution, [,t = |J.(x), as follows: |i(x) is the 0-measure of 
the set of those points (x-^,...,x 2 ) at which f(x-^,...,3^) is less than x. 
(it is understood that the line X: - 00 < x <0° cannot be thought of as 
situated in the (x^, .. .jX^) -space and must not, therefore, be confused 
with the line considered at the beginning of £ 70). 

Now let the given function f be 



k 




("Student's ratio"). The denominator, | j 2 , vanishes on a (k - l)-dimen- 
sional manifold which represents the set F^. mentioned before. Accordingly, 

the distribution 0 must be chosen such that this manifold, which is inde- 
pendent of 0, is of 0-measure 0. It is understood that k is assumed to be 
at least 2 (otherwise the denominator of (303) vanishes identically). 

Suppose now that 0 is of radial symmetry. It will be shown that 
the 0-d.istribution of the function (303) then is independent of the choice 
of 0. The explicit form of this common 0-distribution, a distribution which 
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depends only on the dimension number k and can therefore be denoted by 



= a^(x) , will be a by-product of the proof. It turns out that 
the absolutely continuous distribution having a density proportional to 




(1 + x^) 

(304) 






for - o®< x <o° j so that, from (215) and (215 bis), 



<V( X ) ~ A k( 1 + x2 ) * » where = # * ["* (§k + |)/ p (|k) 



» 



It must of course be assumed that the manifold, F^, on which the 

denominator of (30 3) vanishes is of 0-measure 0. But 0 is of radial sym- 
metry^ and so it is clear that 0(F^.) = 0 if and only if 0(0) = 0, where 0(0) 
denotes, as in § 71, the only possible jump of 0. Accordingly, the as- 
sertion is that the 0-distribution of (303) has the density (304) for every 
distribution 0 which is of radial symmetry and such that 0(0) =0. 

Let N denote the line which passes through the origin of the 
(x^, ... ,x^) -space and is normal to the hyper plane pierced by the x^-axis 
at the point x. = 1 of the latter, where j = l,...,k. Then the k direction 

_i 

cosines of N have a common value, k ^ (> 0, if N is oriented from the origin 
toward the hyper plane) . Hence, if R^ denotes any rotation of the 
(xq_, . . . jX^J-space about the line N, a rotation which is uniquely determined 
only when k = 2, then the k-rowed orthogonal matrix defining has k equal 
elements, k 2 , in its k-th row. Accordingly, if (*x^,.. , ‘x^) denotes the 
space into which the space (x^,...,x^) is rotated by a given R^, then 

k k i 

(3° 5 i) *35^ = ZI. k r 'Xj, i.e., ^ x^ = k 2 . •x fc . 

j=l j=l 
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Thus the square of the denominator of (303) is 1/k times 



k 

Z < x i 

i=l 



k~st 






x. - 



1 

2(k 2 'x, 



) k 



1 



+ k 






v 



i=l 



But the first sum on the right is invariant under any rotation, and so the 
whole expression on the right is identical with the sum of the squares of 
the first k - 1 of the k coordinates 'x^. Accordingly, the denominator of 



(303) is 
(305 2 ) 



k-1 



I I = I 



J=1 



Hence, if the numerator is expressed in terms of (305]_)> it follows that 
the function (303) is transformed by any rotation % into the function 

(306) •f('x 1 ,...,'x k ) = 'x k /(* X;L 2 + ... + 'x^ 2 ) 2 

(in the sense that f (xj_, . . .,x k ) = ’f ( ‘x^, . . ., *x k ) in virtue of Rjj). What 
will be essential in the proof of (304) is the fact that none of the k 



coordinates 'xj occurs in the numerator and in the denominator of (306). 

Let a set of points in the k-dimensional space be called a half-cone 
if it consists of a set of half-lines ending at the origin (so that "half- 
cone" means "half-cone issued from the origin"). Since the ratio (306) 



remains unchanged if each of the k coordinates *Xj is multiplied by an 
arbitrary positive number, it is clear that, if C x denotes the set of 
those points at which the value of the function (306) is less than a given 
number x, then C x is a half-cone. It depends on the choice of the rota- 



tion Rjg turning the original coordinate system, (x 1 ,...,x k ) i n to the 
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coordinate system, ( 'x^, . . . , ‘x^.) , which occurs in (. 306 ); so that 
C = C However, since 0 is of radial symmetry, and since the 

function (306) is identical with the function (303) in virtue of R^, 
the 0-measure of (^(P^) is independent of in fact, it is identical 
with a^(x), if a^x) denotes the 0-measure of the set of those points 
(x^,...,Xj c ) at which the value of the function (303) is less than x. 

How let M be any (oriented) line through the origin. Let the 
coordinate system (x-^,...,x k ) be rotated into a coordinate system, say 
(v^, ... >^.) , which relates to the line M in the same way as the line N 
relates to the coordinate system (x^, . . .,x k ) . Finally, let ( 'v^, . . ., 'v^) 
denote the coordinate system belonging to the coordinate system 
(v 1 ,...,v k ) in the same way as ( 'x-^, .. ., ‘x^) belongs to (x 1 ,...,x k ). 

Then, since 0 is of radial symmetry, the 0-measure of the set of those 

, \ 2 2 i 

points ( 'u-j_, . .. , 'u^) at which the function ’ u } c /( ,u -|_ + ••• + ,u k_i ) , 

which corresponds to (306), is less than x, has the same 0-measure as 
before, namely ct^x). Since this holds for any line M through (0,...,0), 
and since the group of all rotations about (0,...,0) is transitive, it 
now follows, by one more application of the radial symmetry of 0, that 
o^x) is identical with the 0-measure of the half -cone C (r) for any 
choice of the rotation R which turns the coordinate system (x-^,...,x k ) 
into the coordinate system ('x^, ..., ‘x^) occurring in (306) . Before this 
consideration, only rotations R of the particular type R^ were admitted. 

Since C (R) can be referred to an arbitrary R, it is now seen, by 
changing the notions in (306), that the function o^x) , originally defined 
as the 0-distribution of (306), is the 0-distribution of 
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(307) g(x 1 ,...,Xj £ ) _ x k /(x 1 + ... + ) 2 

as well. In fact, all that this means is that a^(x) is identical with 
the ^-measure of the half -cone on which the function (307) is less than x. 
Let this half-cone be denoted simply by C . Then, if the set RF defined 
as at the beginning of § 71, the radial symmetry of 0 = 0(F) implies 
ihat (^(x) = 0(RC x ) holds for every rotation R, since a^(x) = 0(C ). 

It is clear from (220^) that o^(x) = 0(RC x ) cannot hold for every 

v x 

R unless a (x) is a constant multiple of meas c , where c denotes the 

2 2 

set of those points of the sphere x-^ + . . . + x^ =1 which are on the 

X X 

half-lines constituting the half-cone C , and meas c is the (k - 1) -di- 
mensional surface measure of c . But C x was the set of those points 

(x-^,...,Xj £ ) of the k-dimensional space at which the function (307) is less 

x 2 2 

than x. Hence, c is the set of those points of the sphere x^ + ... + x^ 

2 1 

at which the function x^./ (l - x^ ) 2 is less than x. Since the inequality 
2 i 

Xk/d - % > 2 < x can be written in the form x^ < s, where 



(302) 



s = x/(l + yr) z t 



it follows that a^(x) is a constant multiple of the (k - l) -dimen Sion al 

2 2 

surface measure of that portion of the sphere x^ + ... + x^. =1 which 

is to the left of the hyper plane x^ = s, if the x^-axis is thought of as 
oriented from left to right. But it was seen in the proof of (297) that 
this surface measure is a constant multiple of 

(309) J (1 - ^ ) 

-1 
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if -1 < s < 1. Since (308) defines a one-to-onecorresoondence be- 
tween the whole x-axis and the interval -1 < s < 1, it follows that 
a^Cx) is a constant multiple of (309), and therefore the derivative 
a^'tx) is a constant multiple of (l - s^) 2 ^ ^ s i for - oo< x <&> 

This gives a k ’(x) = const. (1 + x^) 2 , if s = s(x) and s' : = s'(x) 
are substituted from (308). Since the numerical value of the factor 
of proportionality is determined by (3), the proof of (304) is complete. 



74. The connecting link, (308), between the projections (297) 
of the spherical equidistributions and the distributions defined by (304) 
suggests for the latter distributions the following geometrical interpre- 
tation: Let the (x-^, .. ,,x^) -space be thought of as Riesann's spherical 
space on which x-^,...,x-j { . are his Cartes5.an normal coordinates, mapping the 
'’sphere" conformally on the Euclidean realization of the (x-, ,... ,x, ) -space; 

so that the coefficient matrix, (g^), °f the squared line element becomes 
.2 2-2 

(1 + X]_ + ... + ) times the unit matrix, if the unit of length is 

normalized the diameter of the "sphere". Since the square root of det (g. .) 

~ k 2 2 ^ 

is (1 + r^) , where r = x-^ ’+...+ x fc , the Riemannian volume element 

2 —lr 

is (l + r ) “ times the Euclidean volume element dxn ... dx, . Consequently, 
if "equidistribution" is meant in the sense corresponding to the definition 
of the distribution 3 introduced after (291), then the equidistribution 
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on the Riemannian space is represented by that absolutely continuous 

distribution on the Euclidean (x^, . . . , 2 ^) -space which has a density 
, 9 -k 

proportional to (1 + r^) . But the projection of this k -dimensional 

distribution on a line - o©< x < o® through the origin ( 0 ,..., 0 ) is 

precisely the distribution a^x) found in § 73 . 

In order to verify this, let 0 denote any absolutely continuous 

2 22 

distribution of radial symmetry, and letd\(r), where r = (x, +...+X, ) 

0 -L Tc > 

denote the density of 0. According to the definition of the projection 
= n 0 (x), introduced after (288), the value of the distribution for 
a given x results if <f^(r) dx-^ ... dx^ is integrated over the half -space 
Xj^ < x; so that d|j, 0 (x), the value of 0 = 0(F) for the infinitesimal 
strip F contained between the hyper planes x^ = x and x^ = x + dx, is 
dx times the integral of <f^(r) dx^. ..dx^ over the whole (x^ . . . ,x^ 1 )- 

2 2 2 1 

space. Since r - (xj_ + ... + x-^^ + x^ )^ ^ f 0 n 0ws that the one- 

dimensional distribution H-^(x) has the density 

M- 0 1 ( x ) = /••• J <f Vp 2 + x 2 ) dx 1 ...dx k _ 1 , where p = (xj 2 + ... + x^ 2 ) 2 , 

But this p represents the radial polar coordinate in the (k - 1) -dimensional 

Euclidean space over which the integration is extended; so that dx-^. . .dx^..^ 

k -2 2 2 

is p times the surface element of the sphere x-^ + ... + x^ ^»l(if k > 2 ) . 

Consequently, 

(310) (x) = const. I* (T (Vp 2 + x 2 ) p 'dp. 



where the const, is the (k - 1) -dimensional Euclidean surface area of the 

sphere (if k > 2; and const. = 2 in the limiting case k = 2). Incidentally, 

made of (315) 

this surface area can be ealculated from (310) itself* IF use is below. 
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In the Riemannian case considered above, the density 



. p ~ k 

was seen to be a constant multiple of (1 + r ) . Hence, the asser- 

tion to be verified is that (310) becomes identical with (304) in this 

case. But the truth of this becomes clear by observing that, if 

a (r) = Const. (1 + r*") , the replacement of p by the integration 

0 

variable q = p 2 /(l + x 2 ), where x is fixed, transforms the integral 

(310) into a constant multiple of (1 + x^) 2 . 

Thus (304) and the derivative of (297) are the densities of 

the projections of the equidistributions carried by the k-dimensional 

2 2 

spherical space and by the sphere x^ + . . . + x^ =1 respectively. 

2 2 

Consider now the equidistribution on the sphere x-j_ + ... + x^ <1. 

This is the k-dimensional distribution 0 having a density cf^(r) which 
is positive constant or 0 according as r ^ 1 or r > 1, Hence, the inte- 
gral (310) becomes 0 or a constant multiple of the integral of p^~^dp 
over the interval p <_ (l - x 2 ) 2 , according as \ x\ > 1 or (x| < 1. 

Accordingly, the density of the projection ^(x) of the equi- 

2 2 

distribution 0 carried by the sphere x-^ + . . . + xjj. < 1 is 0 or a con- 

2 l-(k-l) 

stant multiple of (1 - xj according as \x \ > 1 or \ x\ < 1. On 

the other hand, (297) states that the density of the projection ^(x) 

2 2 

the equidistribution ,8 carried by the sphere Xi '+...+ x. = 1 is 0 or 

2 1 (k-3 ) 

a constant multiple of (1 - x ) 2 according as | x( > 1 or \ x \ < 1. 

Since the exponent, §-(k - 3)> in the latter density, becomes identical 
with the exponent, |(k - 1), in the former density if the dimension num- 
ber of the (x^, . ..jX^) -spade is increased by 2, it follows that the 
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projection of the equidistribution carried by the sphere 



2 

*1 + 



+ 



2 



x k 



< 1 



is identical with the projection of the equidistribution carried by the 
2 2 

sphere + ... + x h =1, where h = k + 2. 

Actually, the identity of these two projections is just a re- 
statement of a classical theorem in multi^dimensional stereometry. The 

2 2 

limiting case k = 1, where the sphere x^ + ... + x^' < 1 is the interval 

-1 < x-^ < 1, represents Archimedes 1 theorem, according to which the sur- 

? 2 2 

face area of a sector cut off from the sphere x^ + ^ ’ + = 1 by a 

plane x^ = x is proportional to the altitude 1 + x of the sector, where 
-1 < x < 1. In fact, if k is replaced by h = k + 2, then the function in 
the first line of (297) becomes linear in the case J(h - 3) = 0. 



75. Instead of k-dimensional distribution <i> of radial symmetry, 

consider for a moment the k-dimensional distributions 0 which are product 

the 

distributions. The latter are defined by the property tha^ 0-measure on 
the (x-^> . • .jX^J-space is the product measure (in the sense of (119) and 
(120), where t. = x., n = k and 0 n = 0) of the projections of 0 on the k 
coordinate axes Xj . It is clear from (118) , (120) and from th j unique- 
ness theorem of the transform (281) , that a distribution 0 is a product 
distribution if and only if 



(311) 0 (y 1 ,...,y k ) -$ o (7 1 ,0,... t 0)<b O ( 09729 ' 9 ' 9 0) ... 0 °(o,o,...,y k ) 
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holds for (28l) as an identity © n the (y-i > . . .>7, )-space. In fact, the 

projections of 0 on the coordinate axes and the transforms of ihese pro- 
jections are given by (287) and (286) respectively; so that (311) is 



equivalent to 

(312) 0 (yp...,^) = 0^ (y x ) 0^ (y 2 ) ... 0 °(y k ). 

(Let 0 (x) denote the one-dimensional distribution representing 
the projection of the k-dimensional product distribution 0 on that line 
through the origin of the (x^ . . .,x k )_ sp ace which has the direction 
cosines a. = k 2 for j = l,...,k. Thus 0 (x) is the 0-measure of the 

J w 

half -space x^+... + x k <xj cf. (305-^). On the other hand, (285) and 
(312) show that 

(312 bis) 0 Q (k 2 y) =0^ (y) 0^°(y) ... 0 °(y) . 

This, when compared with ( 82 ), explains, via (123), the geometrical mean- 
ing of the convolution process.) 

After this digression, consider again a question involving 
radial symmetry. 

It is shown in the elements of the kinetic theory of gases that 

an absolutely continuous product distribution having a differentiable 

density cannot be of radial symmetry unless the density is a constant 

, 2 . 

multiple exp(-r ), if the unit of length is suitably chosen. It is easy 



to prove that this holds without any assumption of continuity, except 
that one has then to include the trivial case of the "unit distribution" 
(for which case the spectrum consists of the origin). 
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In fact, suppose that 0 is a product distribution of radial sym- 
metry. This means that, if q is defined by (288), both (289) and (311) 
are valid. Hence 



yj yj o O n 

(3!3) 1*0 (q) = P-0 (y x ) (y 2 ) ... P^ (y k ), Where q = (y x ' + ... + y fc 2 ) 



y q q 

It follows therefore from (12) that [J.0 (q) = ^ (y^) a (y^) , where 



2 — 
^ s 



q ~ (7i + yp ^ • This can he written in the form 



(3%) f(q x + q 2 ) = f(q x ) f(q 2 )j (3H,,) f(q) = [^(q*), 

where the q are arbitrary non-negative numbers. It follov/s that there 
cannot exist any q = q Q satisfying f(qQ) = 0. This is clear from (314j) 
and (12), if q^ = 0. If, on the other hand, q^ > 0, then successive 
application of (341^) shows that f (q^) = 0 implies f (q^/2 ) = 0 for n = 
1,2,...; so that the zeros of f(q), and so, by (314 ? ), the zeros of (j. (q) , 
cluster at q = 0. But this contradicts (12), since, as proved in § 10, 
the function is continuous. Thus f(q) is a continuous function which 
does not vanish for any q. In addition, f(q) is real-valued in view of 
(291) and (314 2 ). It follows therefore from (12) and (314 2 ) that f(q) 
is a continuous function which is positive for every q. Hence, (314j_) 
can be written in the form g(q x + q 2 ) = g(q 1 ) + g(q 2 ), where g(q) = log f(q) 
is a real -valued continuous function. Consequently, there exists a real 

constant c such that g(q) = cq, i.e., f(q) = exp(eq). 

o p 

This means, by (314 2 ), that \x^ (q) = exp (cq ). it follows 
therefore from (11) that c cannot be positive. Consequently, either c is 
0 or the unit of length can be so chosen that c becomes -J; cf. (17). 



ro|H 
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In the first case, y. (q) is the constant 1. But then (15), (289) and 

(281) imply that the spectrum S {(b) consists of the single point (0,...,0). 

o 2 

In the second case, (q) is identical with exp(-Jq ), which means, by 
(154) » that f x. is the distribution 9 having the density (205 bis). But 
then <t> is the product distribution of k projections (287) each of which 
has the density (205 bis)j so that is the k-dimensional distribution 
having the density 

(315) - (2 «) exp (-Jr )j where r = (x^ +• ... + x^'). 



This is Maxwell's "proof" for the validity of his law of the 

distribution of the velocities in a free ideal gas (the dimension number, 

k, is 3, the dimension number of the natural (x,y,z)-space) . A more 

realistic approach is developed in Boltzmann's Vorlesungen. He considers 

a system of a large number of identical free molecules. Thus, if k is 

the degree of freedom, then, since the conservation of the energy means 

the conservation of the kinetic energy, the velocity components, say 

2 2 

x^,...»3Cj c , are subject to the relation x^ + ... + x^ =1, if the units 
are suitable chosen for every fixed k. Correspondingly, the energy sur- 

2 2 2 

face would appear in the form x^_ + ... + x^ = k , if the units were 

chosenthe same for k=l as for anyk. The resulting discrepancy, introduc- 

1 

ing the reduction factor k" in the linear dimensions, will become funda- 
mental in a moment. 

2 2 

Consider the equidistribution on the energy surface x^ + ... + =1. 

According to (297), the projection of this equidistribution has the density which 

2 b(k-3) 

is equal to B^(l - x ) or to 0 according as x is or is not in the 
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range -1 < x < 1. Hence it is clear that, if the radius of the sphere 
is changed from 1 to k 2 , then the density of projection becomes 
“c 2 | (k-3 ) 

(l - x /k) or 0 according as x is or is not in the range 

ii 1 

-k 2 < x < k 2 . Since k 2 — > as k — ^ o® , and since Stirling’s (or 

Wallis’) formula implies that the numbers defined by (297 bis) 

_i _i_ 

satisfy the limit relation B^k 2 — > # 2 , it follows that, as k — 

_i p 

the density belonging to the altered units tends to # 2 exp(-x~) uni- 

_i 2 

formly on every fixed bounded x-interval. But n 2 exp(-x ) is a den- 
sity which, except for a fixed change of the unit (in the proportion 

l 

1 : 2 s ), is identical with the density (205 bis). 

_1 - 1 - 

Inciden tally, it is seen from (304) that A k k . — 9 a 2 , and that, 
in view of the structure of the function (304), the preceding approach 

to (205 bis) can therefore be applied without any change, if tne k-dimen- 
sional sphere is replaced by the k-dimensional spherical space, considered 
in § 73. This latter approach to (205 bis), when translated into the 
language of the statisticians, means that the distribution of Student’s 
ratio (303) for a large normal sample (x^...^) is approximately normal, 
and teat Student's result (304) is exact for a small normal sample as well. 



181 



REFERENCES 
Monographs 

Aitken, A.C., Statistical Mathematics, University Mathematics Texts, 
no. 2, Edinburgh, 1929. 

Livy, P., Calcul des probability. Paris, 1925. 

, Thlorie de 1* addition des variables aleatoires. Paris, 1937. 

Kolmogoroff, A., Grundbegriffe der Wahrscheinlichkeitsrechnung. Er- 
gebnisse der Mathematik und ihrer Grenzgebiete. Berlin, 1933, 
vol. 2, no. 3. 

Plummer, H.C., Probability and Frequency. London, 1940. 

Saks, S., Theory of the integral. Monografie Matematyczne, vol. 7. 
Warsaw, Lwow, 1937. 

Wintner, A., Lectures on asymptotic distributions and infinite con- 
volutions. Ann Arbor, 19 38. 

The following notes are not of course intended to be complete. Their 
main purpose is to refer the reader to the original papers containing the 
principal general results on the one hand, and examples of theoretical interest 
on the other hand. 

§ 26 - §41. The "Three Series Criterion" (§ 33 ) ior convergence 
almost everywhere on the infinite product space is the fundamental theorem 
of A. Khintchine and A. Kolmogoroff, Recueil Soc. Math, de Moscou, vol. 32 
(1925), pp. 668-677. Essential applications and interpretations of this 
general theorem were given by P. Levy, Studia Math., vol. 3 (1931) > pp. 119-155. 
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L4vj has, among other things, the fact that, in case of infinite product 
measures, convergence in measure implies (and is therefore equivalent to) 
convergence almost everywhere . In the language of infinite convolutions, 
this becomes the criterion given in §33', cf. B. lessen and A. Wintner, 

Trans. Amer. Math. Soc., vol. 32 (1935) , pp. 42-58. In case of a finite 
number of dimensions, the corresponding coalescence (viz., the identity of 
"independence" with "additive independence") does not take place; in this 
regard, cf. P. Hartman, E.R. van Kaiapen and A. Wintner, Amer. Jour, of Math., 
vol. 6l (1939), pp. 47V-4S6. The theorem of ^41 was proved by lessen and 
Wintner (loc. cit.). 

Historically, the whole development was initiated by Borel's formula- 
tion and proof of his "either 0 or 1" theorem (Rend. Palermo, vol. 29 (1909), 
pp. 247-271). Today it is easy, but at that time it was a true mathematical 
achievement, to think of the ordinary measure on the interval 0 < x £l as a 
product measure in an infinite product space, the factors being the binary 
spaces corresponding to the dyadic expansion of x. It may be mentioned that 
not only the general theory of "random variables" but also Birkhoff's ergodic 
theorem are patterned after Borel's theorem, which indeed is a particular 
case (with metrical transitivity) of the ergodic theorem. 

$42 - $48. Under the restriction to finite second moments, the 
determination of the Fourier transforms of semi-groups was carried out by A. 
Kolmogoroff , Rend. Accad. Lincei, ser. 6, vol. 15 (1932), pp. 805-808 and 
866-869. The general theorem, that concerning arbitrary semi-groups, is due 
to P. Bevy, Ann. di Pisa, ser. 2, vol. 3 (1934) > PP* 1934} vol. 4 (1935) > PP* 
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217-218. The above proof is based on the considerations used for finer pur- 
poses by P. Hartman and A. Wintner, Amer. Jour, of Math., vol. 62 (1940), pp. 
757-779. 

§49 - §52. Concerning the oldest criteria for the validity of Fourier's 
inversion formula, cf. the historical comments of A. Pringsheim, Jahresber. 

Math. Ver., vol. 16 (1907), pp. 2 - 16, according to which the first criterion 
applicable in case of conditional convergence is one given by A. narnack (1885). 
The criterion of $ 52 for continuity is nothing but the mathematical physicists' 
definition of a wave-length contained in the line spectrum; it may go back to 
Stokes (or to the school preceding him in optics). In connection with distri- 
bution functions as such, it was pointed out by P. 4lvy£2 3, pp. 169-172. The 
Stieltjes form, (185), of Fourier's inversion and, first of all, the fundamen- 
tal completeness theorem of § 9 are also due to him (loc. cit.; cf. Comptes 
Rendus, vol. 175 (1922), pp. 854-856 and Prace Mat.-Fiz., vol. 39 (1932), pp. 
19-28). 

The criteria (198), (200) for a high degree of smoothness were adapted 
to infinite convolutions by A. Wintner, Amer. Jour, of Math., vol. 55 (1933), 
pp. 309-331 and vol. 56 (1934), pp. 659-663; for further applications, cf. B. 
Jessen and A. Wintner, loc. cit. (and, in the latter connection, E. K. n aviland 
and A. Wintner, Puke Math. Journ., vol. 2 (1936), pp. 712-721). The existence 
of the symmetric "stable" distributions, those assigned by (202), was tacitly 
assumed by Cauchy for 0 < p < oo , and only L^vy observed the necessity, and 
has proved the sufficiency, of the restriction p _< 2; for references, cf. A. 
Wintner, Amer. Jour, of Math., vol. 58 (1936), pp. 45-90, where a sharper re- 
sult is proved. The expansion of £ 55, where 0 < p < 1, was given in Duke 
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Math. Jour., vol. 8 (1941), pp. 678-681. 

§57 - § 68 . In § 59 , the mapping of the unit interval on the infinity 
product space of alternative choices is the idea which, in connection with 
Borel's paper, was pointed out above. This, the simplest, particular case 
of the general theorem of Khintchine and Kolmogoroff (1925) has been dealt 
with by R.E.A. 6 . Paley and A. Zygmund, Proc. Cambr. Phil. Soc., vol. 26 (1930), 
pp. 337-357. The infinite product (218) as such was introduced as a fairly 
general source of instructive examples (and counter-examples) by B. lessen and 
A. Wintner, loc. cit., where the results of §58 were also given. To 60 - 
§62, cf. F. Hausdorff, Math. Annalen, vol. 79 (1919), pp. 157-179; the ap- 
plication of Hausdorff measures to (217) was given by R. Kershner and A. Wintner, 
Amer. Jour, of Math., vol. 57 (1935), pp. 541-548. Those results of §63 which 
concern an irrational r were proved by P. Erdfis, ibid., vol. 6 l (1939), pp. 
974-976; cf. also vol. 62 (1940), pp. 180-186). As to § 64 , cf. N. Wiener sad 
A. wintner, ibid., vol. 60 (1938), pp. 513-522 and Jour, of Math, and Phys. 
(M.I.T.), vol. 17 (1939), pp. 233- 246 . The estimates of § 66 were given J^with 
the unnecessary restriction to the case (217)] by A. Khintchine, Math. Annalen, 
vol. 96 (1927), pp. 152-168. 

§ 69 - §75. The consideration of projections of distributions of 
radial symmetry goes back to Boltzmann's approach to the velocity distribu- 
tion in ideal gases, subsequently considered by Borel. The above treatment 
is that given by P. Hartman and A. Wintner, Amer. Jour, of Math., vol. 62 
(1940), pp. 757-779, where detailed references will be found to the literature 
of the subject. 
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Student's distribution is usually deduced as a property of normal 
distributions, rather than, as in §73 f as a purely geometrical fact valid, 
for reasons of homogeneity, for any distribution of radial symmetry (whether 
the distribution be a product measure or not). For the corresponding metrical 
interpretation in terms of Hiemann's spherical geometry, cf. A. Wintner, 
Astronomical Papers Dedicated to Elis Strflmgren, Copenhagen, 1940, pp. 286-297. 



